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US PAT NO: 5,051,364 C IMAGE AVAILABLE 3 L10: 1 of 3 

TITLE: Anti-lipocortin-I and anti-lipocortin-II monoclonal 

antibodies 

ABSTRACT: 

The present invention provides monoclonal antibodies which are specific 
for one but not both of human lipocortin-I and human lipocortin-II, as 
well as cultures of hybridomas and other types of cells producing such 
antibodies. 

INVENTOR: Clare M. Isacke, La Jolla, CA 

Ian S. Trowbridge, San Diego, CA 

Tony Hunter, San Diego, CA 
DATE ISSUED: Sep. 24, 1991 
DATE FILED: Dec. 20, 1989 

US-CL-CURRENT: 435/240.27; 424/85.8, 85.91; 435/70.21; 530/387, 389 

US PAT NO: 4,935,233 CIMAGE AVAILABLE! L10: 2 of 3 

TITLE: Covalently linked polypeptide cell modulators 

ABSTRACT: 

Described is a new class of polypeptide cell modulators characterized by 
being composed of two covalently linked cell modulators in a linear 
polypeptide sequence. Such dual function polypeptides have new and 
particularly useful activities when the component polypeptide cell 
modulators are interferons, lymphokines or cytotoxins which act through 
different and specific cell receptors to initiate complementary 
biological activities. 

IMVEWTORs Leslie D, E<ell 9 Thame, United Kingdom 

Keith G» HeCullagh, Princes Risbor ough , United Kingdom 
Alan G. Porter, High Wycombe, United Kingdom 

DATE ISSUED a Jim. 19, 1990 

DATE FILED: Dec. £ 9 1985 

US-CL-CURRENT s 424/85.5, 85-6, 85.7? 435/69.515 530/351 

US PAT NO: 4,543,439 L10: 3 of 3 

TITLEs Production and use of monoclonal antibodies to 

phosphotyrosine --containing proteins 

ABSTRACT s 

A hybridoma cell line is disclosed that secretes monoclonal antibodies 



which serve as a hiyh titer, "rsp'rcduci bla, "biological reanent useful 
b i a 1 o g i c a I / m e d i c a 1 r e s e a r c h f o r i » o 1 a t :i. n g a nd :i. d e n fc i f y i n q "' 

p h o s p h a 1 y r a s i n e c o n t a i n i vs g p r o t e i n s , In add i t i o vi , the' a n t i b o d i e * h a ... - 

potential uses in diagnosis of a variety of diseases, includinq certain 
c a n c e r a; . iha a n t i !:s o dies, whi c h have d e m o n s t r a t e d affinity f o r a v a r :i p +: v 
o i m o 1 e c u 1 e s c o n t a i n i n g o p h o s p h o t >- r o s i r. e r e » i d u © s ,. w e r © ' p r e p a r e d u s i n ij 
synthetic analog, p~azofoenzyl phosphonAte <ABP) covalsntly linked -ho 
carrier p?otein, as the antigen. 

INVENTOR si A, Raymond Frackelton, Jr., East Providence, RI 

Herman K. Eisen, Waban, MA 
ftlonzo H « Ross,, Bensalem, PA 

BATE ISSUED s Sep. £4, " 

DATE FILED s Dec. .13, 19S2 

US-GL-CURRENTs 435/78. £48.57, 948'j 935/98 
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2/7/3. 

9©2a4 3ft BIOS IS Humber it 93© 134 3 a 

DIFFERENTIAL BINDING BIOLOGICAL AND BIOCHEMICAL ACTIONS OF R'FCrHIR ■rNa.WT 
PDGF A A AB AND BB MOLECULES ON CONNECTIVE T I SSI IF CF» » R 

GROTENDORST G R; X GAR ASH I ftj LARSON R 3 SOMA ¥g CHARGTTF. 

DEP. CELL. BIOLOGY ANATOMY, UNIV. MIAMI SCH„ MO>. , P.O. mx «'if-QAt» mtsmt 
FLORIDA 3310.1 „ ""* 

J CELL PHYSIOL 149 -CK>„ 1991. £;35 £43 „ CODENs JCLLA 

i-u.1.1. Journal Titles Journal of Cellular Physiology 
■ Languages ENGLISH 

.„ y . U f. have compared the biological and biochemical properties of recombinant 
***** HfcS * *»»« using three types of fibroblastic cells: NIH/3T3, hufwrs 

EKin tioroblast, and fetal bovine aortic smooth' nmscle. PBOF bind-ircf, 
receptor autophos phoryiat ion, phosphatidyl inositol hydrolysis, as well *4 
cneaotsotic and mitogenic responses of the cells were analysed. * PPG^-AB and 
p D J F f B B h a w * d » i m i 1 a r r e c e p t o r b i n d i n g , reeept o r a u to p h o s p h o r viatic n . a n d 
potent ^ bio logical activity for all three of the ceil types tested,, In 
contrast, PDGF--AA was biologically active onlv for the NTH/.TTS r^i i 1 ';' n 
which binding sites for PD6F-AA were abundant, but was inactive for boviw 
aortic smooth muscle calls and human skin fibroblasts in whi Hi bind inn 
s ;i .tes -J or PDGF-AA were absent. PDGF-AA could not indues any biochemical' 
changes m the human skin fibroblasts or smooth muscle cells- We<*t*»vn hi .•■,■>•• 
studies with ant i -Type .alpha, and ..beta. PDGF receptor antibodies indirfttp 
vn&v tne N.I,H/3f3 calls contained both PDGF ..aloha., and . beta. r»n*»«tr,r« 
wnereas the human. skin fibroblasts and bovine smooth muscle eel 3 * ' rental vi«4 
only detectable levels of ..beta., receptors. These results indicate -hhat: 
ce.,is possessing high levels of PDGF ,, beta,, receptors only are capable of 
responding squally well to either PDGF AB or BB ,. 

e/?/s 

&9S3.6S& BIOS IS Numbers 870 1 ££07 

KIDNEY EPITHELIAL CELLS EXPRESS C--S1S PROTOONCOGEHE AND SFHRP'T P5)RF J f KB" 

PROTEIN ~ 

K ART HA S| BRADHAf! D PI 5 GROTEHDOftST G R ?, TOBACK F G 

D E: P =. M E D .. BOX 4S3, U N I V . C H I CAGO, S84 i S 0 U T H 11 ARYLA H D AVF .. „ f: \-{ r r ft n n 
ILL,. 68637. " 

AM J PHYSIOL. £55 (4 PART S) . 1968. F088 F60&„ CODEhh: AJPHA 
Full -Journal Titles: American Journal of Ph vsi /--.i r-.q 
L a n § u a g e s E N G L I S I j ' " " ' 

Nontransformed monkey kidney cells <BSC-1" line),, used as a fimd^ fn^ 

renal epithelium, were assayed for release? of platelet derived growth 

.•act;or U-Dutr > »-i ike proteins.. BSC-1 cells continuously released a miton^n'ir 
activity for fibroblasts and a chemoattractant activity for smooth m^nl * 
u^ils, each of which was inhibited 80 •-90* by an antibody to human PDGF 7 A 

CWNH probe for the PDGF B chain gene Xc-sis)., but not for' the A -chain ^n.= ..' 

h y b r i d i z e d t o m R H A o b t a i n e d f r o m ■ B r o wing a n d q u i e s c- e nf e 1 1 s c • s i s gone 

ax pros si on and PDGF- like pro-! n secretion wore studied , th^- p^^^^t n * 
krH:mn 9 r?" t . h ™: r . e 9.VV^f*5.; f, y molecies., A secreted BSC-:!. cs>l 1 protein identical 



l^ft accumulation of c-sis^niRNft without a 
'^re release of PDGF-like £^™.vity« Adenosine 



tco cransYorsiung growen Tacuoy * oeca«c: iniuouea umh syncnesis in growi-nt; 
cultures and induced ma*H( 
correspond ing inc rease in 

d i phosphate stimul a ted DNA synthesis in quiescent cultures and enhanced 
both c-cis expression and release of PDGF-1 ike activity. However, growing 
the quiescent eel Is did not ex p ress the PDGF receptor gene o r exhibit a 
mi to genie response to authentic PDGF a Thus the PDGF-1 ike protein released 
by these kidney epithelial cells could contribute to growth control by a 
paracrine mechanism. 
?s pdgf<w)like not s2 

1889 PDGF 
13438c! LIKE 

131 PDGF (U) LIKE 

53 130 PDGF (W) LIKE HOT 82 
?s s3 and (endothel? or f i boob last?) 

130 S3 

43997 ENDOTHEL? 

65919 F I B R OBLAST? 

54 61 S3 AND (ENDOTHEL? OR FIBROBLAST?) 

?s s4 and (3S i 9^i4toX<iaaoerramri>iho&) or 33&<8w) (kcl or kilodalton?) or (36000 <£n) molec 
qu 1 ar ) or ( ( 36000 ( 2 n ) (mo 1 ecu 1 ar or re 1 at :i. ve or d a I ton? ) ) 

) > > Command is too complex — please simpl i f y 
? s s 4 and (34 9 o r 36 or 36000) 
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5/6/1 

910018? BIOSIS Numbers 93085187 

EFFECTS OF T CELLS ON PLATELET-DERIVED GROWTH FACTOR-LIKE PROTEIN 
SECRETION FROM ENDOTHELIAL CELLS 

5/6/2 

7179912 BIOSIS Numbers 88102657 

RETINAL PIGMENT EPITHELIAL CELLS PRODUCE PDGF-LIKE PROTEINS AND SECRETE 
THEM INTO THEIR MEDIA 
?t s5/7/all 



LIKE PROTEIN 



CODENs JHLTE 



5/7/1 

9100187 BIOSIS Numbers 93085187 

EFFECTS OF T CELLS ON PLATELET-DERIVED GROWTH FACTOR 
SECRETION FROM ENDOTHELIAL CELLS 

SHADDY R E§ HANSEN J C COWLEY C G 

100 N. MEDICAL DRIVE, SALT LAKE CITY, UTAH 84113. 
J HEART LUNG TRANSPLANT 11 (1 PART 1). 1992- 48-56. 
Language s ENGLISH 

The cause? of accelerated graft arteriosclerosis after 
transplantation is unknown. To examine whether the interactions of T 
and endothelial cells <ECs) could contribute to the cause of 
phenomenon, T eel Is were co-incubated with human umbi 1 ical artery 
endothelial cells (HUAEC) and human vein endothelial cells <HUVEC) and the 
resulting cond i t ioned medium su per nan t ant was assayed for the presence of 
platelet-derived growth factor <PDGF) -like protein- PDGF-like protein 
secretion was significantly greater from HUftECs co-incubated with T cells 
ar T~cell/HUAEC ratios of 30s 1 <fc. 9 l„i fmol/106 ECs) and 10s 1 <6.0 

1*1 fmol/106 ECs) than the combined background secretion from HUAECs 



heart 
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PDGF-<Lika protein P ZTr- " * " "7 f, """ > >""" " ' ,p , Bti BD , 

-* BMi ' * V - Bflt ' 1 / Plater, howevPT, thv iMtitr*-* ' " , ! * wf " rturtbu© was 

at fcall/HUVEC ratio, -*w 1* - ' co ™«foated with T cells 

HUAECs with , ga.ma,, inte, e "^M^/TrV^'^T^' P^^^ticn of 

actually caused a decrease in T celf i h„ iS SU?"*-, 8 *" "P™*^™ but 

above background iev.l, , * <? ! i l, r Ut ?° ^<f^ lke protein semtion 

nonstimulated HUftEC* <4.« - " V * r w , ' : / iMh tCs) whe " compared to 

P-tein secretion w „ ^ ^ 1 * ^ \ J. < *- 9S > -^"1 ik. 

maximum at 24 hours,, Th© co^1u,^„ 1 « th it T ~ i li,CrM$ed oveY ' to a 

aeration .of a very pnt » ^ t * ^ ' are *«P«ble of indueinq 

cells, Thi, cell-in „ e , ; « ; P ?T ;Ji* Vr ot " in «- endothelial 

primarily or sxciusively a W ^ K ... ..• D ' A V 4 f, P™*«i« appears to be 

r - 1 u H c . ul «i's,srui endothelial cells., 

5/7/K? 

7 .1.7991 2 BIOS. IS Numbers Bfli«Pf«S7 

ThS'i^ ?^?r«p"TP™ £LIftL CELLS PR ° DUCE ^--LIKE PROTEINS AND SECRETF 

Fvp FYP ■ r, « ^ ! ^v U Ii*fe£K. LW " rttl) " « CHARLOTTESVILLE, Vft„ Rf»ge,A 

rull Journal Utl.es Experimental Eve R^ w », 
l. a n g u a 9 e s E N G L. I S H ' "' " " " ' 

H u m & " r- • t i « a 1 p i q m e n t e p :i. t h e 1 -i a i - 1 i : - 

condition serum free Dxnh*rlo>^LoZU'nVr* - Coy , itiu ^ ce was used to 

were exhaustively d i-a : v^d ai^ ^ J' d - " ' S . ? edlu «»- Conditioned media 
subjected to Western Wot ^r^rr- ? . * C ' tlC *? ld » lyophili*«d, and 

fraction prepared f TM ^ti^^ " *ri**ry antibody an :i: !;i n 

Pl^lBt derived qrowth far •'■•or w -i -i- T-",*,- i" 1 ! ■, . d ^ eCted human 

'^solved as a band with ir "r* ! m ' ^^sier-derived growth factor was 
bands „ith Mr of 3f,4l 5 "V," , ^ - ^ f r undeir «an--recfucin a conditions, while 

fpI^TTa l cells— so contTTKFB^^ -,r*o*. C J" °' ^tinal pigment 

for 48 hr exhibited much df-nCh^Tt-" ? K dnd mediA conditioned 
fanda were detected W h*„ n "i' ^ ! ^" , ? ond 24 hr„ Mo 

primary antibody and „h Pn Jordi-f --^ ' dCtl °" 1S W9rs Substituted for 

l^mf" fibroblast ii„p, thi^t;" W :;t d . mea3 - a ,r Te P-^P^-^^ from several 

m-U^nt epithelial cells/ o dit^h" "h^" r ? " >S " P-Pared from retinal 
IPa/ „ er e seen.. Retinal p i .men t « ; i?K i i,^ ' ^ " .. " ^ X . & ** 36 

If* Ni.he presence of r3f;RiL4 ( H ^ X' ^u-conduxoned media prepared 

!lro«th , factor lq P ^ - l0 * ded f n «"*i-pl«tel w t derived 

3ps-poiya>ryl ami de" el e T e n t ^ o n h r - ^ /' n ' 9i ^** d .'. av,d subjected to 

a„d \a k»a Wf =>;« vi^il ..Z I b,!t, ( , V B Wlth slightly less than 3S 

!P^- -onditVned m.d i a ari ii^ y ' ? ™. T !^ ^'Tj 

:?RXumn a 1 s.o\ marked 1 y ^irt^vVd f'^ J^"^ ^ , , ''^ fc;Lons ei"ted from the 
:»nthymidina, h ^ ; H t« * \ ~ ' *^ cheBIO **'«i» *"d ^corporation of 

"flti plateltt-d erivid ' nrnwtn -f:";"' t-^t, neutralized by soluble 

HSmant epithelial ,4 " i n ^ ' r " ! " H ; e£J& d * ta *"DB«t that retinal 

'^tor-lik. proteins ai d ^r^to^-L Pl^ltt-de.iv.d growth 

;: ? Pjbl, of :t:,,,ft,,n 3 M^o^ 1 ^,:- ^Jili f e^tion!^ ^ — 

■ 30n,ay92 IS « S5 8 5& Usore:i7743 Session 
• f ltd, 4 4 0„ US Hrs FileS 

*8. 68 a Typ©<si> in Format S 
*3.. 88 4 Type(s) i n Format 7 
«3„08 £, Types 
*i ,:5„4 4 Estimated cost F i f » !: " 

* X 4 . G 9 E » t i m atec! e o s t t h i « s p a -r> ^ h 

!^" 33 Estimated total s ess i ovi ' cost M r « 
pgotts level 29.81.02 D ISsSS^.Sfi "° " "'" 

if«...!ME'P !• call cleared by rocjueot 



istfu iaau awu -iuiu ^uisiu ^u^u ^utu 

AGATAGAATGACAGTCCGTCAAAACAGATTGTTTGCAAftGEGGAGGCAT CAGTGT — CTTGGCAGGCT 

lilt i ii I i t I I lit it ii I I i I I i i i i i i 

it i i i ii i ii ii ill ii il ii*ii i i i i t i 

AGGTACTAGCTGAGGTTATTTTAAAAG CAGCAGTGTGCCTACTTTTTGGAGTGTAACCGGGGAGG-G 

3240 3250 3260 3270 3280 3290 3300 

2050 20G0 2070 X 

GATTTCTAGGTAGGAAATG— TGGTAG— CTCACG 

i i • i • > itiiitii ii 

i ii tit tii t t I* i ii 

AAATTATAG CATGCTTGCAGACAGACCTGCTCTAGCG 

33 lO 3320 3330 3340 
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HUMFGFAA Human fibroblast growth factor receptor mRNAi comp 
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DEFINITION 
ACCESSION 
KEYWORDS 
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HUMFGFAA 3901 bp SS-mRNA 

Human fibroblast growth factor receptor 

MG0485 



PR I 
TnRNAf 



1 5— MAR— 1 99 1 

complete cds. 



Vertebrata ; Mamma 1 i a ; 
Catarrh i n i ; Horn i n i dae. 



Nguyen »T. f Mason i O. B. » 



s ig_pept ide 



mat_pept i de 



BASE COUNT 
ORIGIN 



Human i cDNA to mRNA ■ c 1 one fig 5. 
Homo sapiens 

Eukaryota ; An i ma 1 i a ; Metazoa ; Chordata ; 
Ther i a ? Euther i a ; Pr i mates ; Hap 1 orh i n i ; 
1 (bases 1 to 3901 ) 

KieferiM. C. t Baird»A. * George-Nascimento>C. 
Boley»L. J. » Valenzuela«P. and Barr ( P. J. 

Molecular cloning of a three- i mmunog 1 obu 1 inl ike-domain form of a 
human basic fibroblast growth factor receptor cDNA= Expression of 
biologically active extracellular domain in a baculovirus system 
Unpub 1 i shed < 1 99 1 ) 
simple staff _entry 

Locat i on/Qua 1 i f i ers 

672- .3134 

/product= H f ibroblast growth factor receptor" 
/gene= "fig" 
/codon_start= 1 

672. . 734 

/gene» "fig" 
/codon_star t= 1 

735. . 3131 

/gene= "fig" 
/codon^start^ 1 
839 a 1187 c 1139 g 736 t 



Initial Score 
Res i due I dent i ty 
Gaps 



1 46 Opt i m i zed Score = 878 S i gn i f i cance — O. OO 
47% Matches = 1050 Mismatches = 912 

226 Conservative Substitutions = O 



X lO 20 30 40 SO 

CCCGGC-CGA-CAGCCCCGAGACGACAGCCCGGCGCGTC-CCGGTCCCCACCTCCGACCACC 



GCGAGCGCGCCGCTGCTTGAAAAGCCGCGGAACCCAAG- 
170 180 190 200 



■GACTTTTCTCGGGT— CCGAGCTCGGGGCGCC 
2 1 0 220 230 



60 70 80 90 lOO HO 120 

GC-CA-GCGCTCCAGGCCCCGCGCTCCCCGCTCG — CCGCCACCGCGCCCTCCGCTCCGCCCGCAGTGCCAA 



i i 
i i 



CCGCA6GCGC— ACGGTACCCGTGCT— GCAGCTGGGCACGCCGCGGCGCCGGGGCCTCCGCAGGC- 
240 250 260 270 280 290 



■GCCGG 
300 



1 30 1 40 1 50. 

CCATGACCGCCGCCAGTATGGG? 



EC — 



160 .170 
— GTCCGCGTCGCCTTCGTGGT 



■ i 



180 190 
CTCGCCCTCTGCAGCCGGC 



CC— TGCGTTCTGGAGGAGGGGGGCACAAGGTCTGGAGACCCCGGGTGG— CGGACGGGAGCCCTCCCCCCGCC 



C-ATTTTTCCGTAAAAATGftTTTCAGTAGC-ACAAGTTATTTAAATCTGTTTTTCTA-AC-TGGGGGAAAAG 



CTATTTTT — G6GAAACT6GCA 
2500 25 1 0 



GAAACTCCCAGTCTCTCCCCTCAGA 
2520 2530 2540 



ATAAACACAAGTCAGATGAC 
2550 25G0 



1310 1320 1330 1340 1350 1360 1370 

ATTCCCACCCAATTCAAAACATTGTGCCATGTCAAACAAATAGTCTATCTTCCCCAGACACTGGTT-TGAAG 



i i 
• i 



iiii 
• tit 



* i i 
i i i 



i i 



i t 
i i 



* i 



t i 
• i 



i i 



* i 



ATAGGGCTAGTCTACAAAGGCTTGAGGAAGGCCACTCCCGTTGTAGTAATT — GCTGT TTCTCCTTCTG T 

2570 2580 2590 2600 2610 2620 2630 



1 380 1 390 1 400 1410 1 420 1 430 1 440 

AATGTTAAGACTTGACAGTGGAACTACATTAGTACAC AGCACCAGAATGTATATTAAGGTGTGGCTTT 



i i i 
i i i 



• i 



i i 
i i 



• * t 
■ t t 



t*ii 
iiii 



■ i i 
i i i 



■ i i » i 
t i i t i 



CTTCCTTAGA— AGGGCA — AAAAGT6CATCC6GACACCTAAAATCGCCAAGCCTGTCAAGTTTGAGCTTTCT 
2640 2650 2660 2670 2B80 2690 2700 



1450 1460 1470 1480 14SO 1500 1510 
AGGAGCAGTGGGAGGGTACCGGCCCGGTTAGTATCATCAGATCGACTCTTATACGAGTAAT-ATGC-CT 

i i i i t # i it i ii i itit it ii i i till t iiii 

I t I I I t I I t I II I IIII II II I t till f IIII 

GGCTGCACCAGTGTGAAG — ACATACAGGGCTAAGTTCTGCGGGGTG-TGCACAGACGGCCGCTGCTGCACA 
2710 2720 2730 2740 2750 2760 2770 

1 520 1 530 1 540 1 550 1 5GO 1 570 1 580 

GCTATTTGAAGTGTAATTGAGAAGGAAAATTTTAGCGTGCTCACTGACCTGCCTGTAGCCCCAGTGACAG-C 



• i i 
■ i i 



■ t 



• i 

• i 



■ t 



t i 



t ■ 
■ i 



CCGCACAGAACCACCACTCTGCCAGTGGAGTTCA-AATG — CCCCGA — TGGC — GAGATCATGAAAAAGAA 
2780 2730 2800 28 10 2820 2830 

1590 1600 1610 1620 1630 1640 1650 

TAGGATGTGCATTCTCCAGCCATCAAGAGACTGAGTCAAGT-TGTTCCTTA-AGTCAGAAC-AGCAGACTCA 
ii iiitt it ii ti t i i i i i i i i i i i i t i iiii 

It | I 11 It I I I III * t I I I f I I I IIII 

TATGATGTTCA TCAAGACCTGTGCCTGCCATTACAACTGTCCTGGGGACAATGACATCTTTGAGTCCCT 

2840 2850 2860 2870 2880 2890 2900 



1 660 1 670 

GCTCTGACATTCTGA — TTCGAATGACACTG- 



1680 1690 1700 1710 
TTCAGGAATCGGAATCCT6TCGATTAGACTGGAC- 



I i 
l t 



lit 
l l l 



t i 
i t 



t t 
t i 



i t i i 
titl 



l i t i 
i t l t 



i i 
i t 



itt 
t l i 



t t t 
I I I 



GTACT— ACA6GAAGATGTACGGA— GACATGGCGTAAAGCCAGGAA — GTAAGGGACACGAACTCATTAGACT 
2910 2920 2930 2940 2950 2960 2970 

1720 1730 1740 1750 1760 1770 1780 
AGCTTGTGGCAAGTGAATTTGCCTGTAACAAGCCAGATTTTTTAAAATTTA— TATTGTA— AATATTGTGT 



ATAACTTG- 
2980 



•AACTG— AGTTGCATCTCATTTTC- 
2990 3000 



TTCTGTAAAAACAATTACAGTAGCACATT A 

30 1 O 3020 3030 



1790 1800 1810 1820 1830 1840 

GTGTGTGTGTGTGTGTATATATATATATATATGTACAGT — TATCTAAGT — TAATTTAAAGTTGTTTGTGC 



i t i t i 
i t i i i 



i i 
i i 



i t 
i i 



i i 
i i 



■ ■it 
iiii 



i i 
i l 



I i t 
i i t 



i J 
i • 



ATTTAAATCTGTGTTTTTAACTACCGT6GGAGGAACTATCCCACCAAAGTGAGAACGTTA TGTCATGGC 

3040 3050 3060 3070 3080 3090 3100 

JO I860 1870 1880 1890 1900 19 10 

CTTTTTATT — TTTGT — TTTTAATGCTTTGATATTTCAATGTTAGCCT— CAATT— TCTGAACACCA— TAG— 



* i i 

• i ■ 



• i 

• i 



iiii 
iiii 



t i 
t i 



ii ii 
it ii 



ii iii 
ii iii 



CATACAAGTAGTCTGTCAACCTCAGACACTG — GTTTCGA— GACAGTTTACACTTGACAGTTGTTCATTAGC 
3110 3120 3130 3140 3150 3160 3170 



1920 1930 1940 1950 1960 1970 

GTAGAATG TAAAGCTTGTCTGATCGTTCAAAGCATGAAA TGGATACTTATATG— GAAATTCTGCTC 



i i 
■ i 



it ii 
it ii 



titi 
iiii 



■ » 



ti ii 
it it 



iiii 
iiii 



ttti 
■ i i i 



GCACAGTGCCAGAACGC— ACACTGA— GGTGAGTCTCCTGGAACAGTGGAGA— TGCCAGGAGAAAGAAAG— AC 
3180 3190 3200 3210 3220 3230 



480 490 500 Z>1V Z>3V t^4U 

TTCCAGAGCAGCTGCAAGTACCAGTGCACGTGCCTGGACGGGGCGGTGGGCTGCATGCCCCTGTGCAGCATG 

• •■It t I ■ ■ I I I I I I * (111* I I I t • I I I I I I « I I • I I I I I I I I I I I I filltll I I I I I I I I t 

I I I I I | | | | t I I I I I I I I I I ■ I I I I I I I I I I I 1 I t I I ■ I ■ I I I I t ■ I 1 I I I I I I I I I I I « I t I I 

TTCCAAAGCAGCTGCAAATACCAATGCACTTGCCTGGATGGGGCCGTGGGCTGCGTGCCCCTATGCAGCATG 
1 T40 1 750 1 T60 1 770 1 780 1 790 1 800 



550 560 570 580 590 600 6 1 0 

GACGTTCGTCTGCCCAGCCCTGACTGCCCCTTCCCGAGGAGGGTCAAGCTGCCCGGGAAATGCTGCGAGGAG 

i i i i i it ■ ■ i t i i i i • i t i i t i i i i i i i i i i i i i i t i i i i i t t i i i i i i i iiiittiiitti i i i i ■ 
i i i i t it i i i t i i i t ■ ■ i i i i t ■ i i i t i t i i i i i i i i i i i i t ■ i i i i i i i i i i t t i i i i i i ■ i t i i i 

GACGTGCGCCTGCCCAGCCCTGACTGCCCCTTCCCGAGAAGGGTCAAGCTGCCTGGGAAATGCTGCAAGGAG 
1810 1 820 1 830 1 840 1 850 1 860 1 870 



620 630 640 650 660 670 680 
TGGGTGTGTGACGAGCCCAAGGACCAAACCGTGGTTGGGCCTGCCCTCGCGGCTTA— CCGAC TGGAAG 

t I I 1 I I I I t 1 I I I I I I I I I I I I I I I II I till! I t I I I < I I II I I I I I I III 

I I | I I I I I I I I I I I t t I I I I I I I I I II I I I I I I I I I I t • I 1 II I I I • * I III 

TGGGTGTGTGACGAGCCCAAGGACCGCACAGCAGTTGGCCCTGCCCTAGCTGGTGAGTTGTCTCCTTCAAAG 
1880 1890 1900 1910 1920 1930 1940 



690 700 7 1 0 720 730 740 750 

ACACGTTTGGCCCAGACCCAACTATGATTAGAGCCAACTGCCTGGTCCAGACCACAGAGTGGAGCGCCTGTT 

iiii ii t* ii » t i i i i i i it iiit tittt iii tit 

tiii ii t t ii i i i i i i i i it iiit i i i i i i t i tit 

TTAC— TGTTATTCATTCTC— CCCAACTTCAG— GCCAATGCCCAAGTCC — ACCAAATTAAGGGGAAATTGTC 

1 950 1 960 1 970 1 980 1 990 2000 20 1 O 

760 770 780 790 800 810 820 

CCAAGACCTGTGGGATGGGCATCTCCACCCGGGTT— ACCAATGACAAC— — GCCTCCTGCAGGCTAGA GA 

ii iii i i i i i iiii i i iii i ii iii i i it i 

ii iii i i tit i iii t i iii i ti iii i i ii i 

CTA — TCCGGATGTTTTACCTTGTGTTTGTGTGTTCTGCTCTCACAGCCTACCGACTGGAAGACACATTTGG 
2020 2030 2040 2050 2060 2070 2080 

830 840 850 860 870 880 890 

AGCAGAGCCGCCTGTGCATGGTCAGGCCTTGCGAAGCTGACCTG— GAAGAGAACATTAAGAAGGGCAAAAAG 

t i t i t t t t t t itt t iii i i t i i i t tit lit i i i i i 

■ i t i i i ii it iii i tti i i i i i t t i t i tii i i i t i 

CCCAGACCCAACTATG-ATG — CGAGCC AA— CTGCCTGGTCCAGACCACAGAGTGGAGCGCCTGTTC 

2090 2100 21 lO 2120 2130 2140 



900 910 920 930 940 950 960 

T— GCATCCGTACTCCCAAAATCTCCAAGCCTATCAAGTTTGAGCTTTCTGGCTGCA— CCAGCATGAAGACAT 

■ i t ii i i i t i i i i ii iii it i ■ i t i i i i ttii it 

i i i it i t i i i i i i ii iii it i i i i i i i i i i i i i i 

TAft6ACCTGTGGAATGGSCATCTCCACCCGAGTTACCAATGIACAATACCTTCTGCAGACTGSA-SAAG-CAG 
2150 21G0 2170 2180 2190 2200 2210 



970 980 990 lOOO 1010 1020 1030 

A-CCG-AGCTAAATTCTGTGGAGTATGTACCGACGGCCGATGCTGCACCCCCCACftGAACCACCACCCTGCC 

i til li ti t t i ii i Itt l i t t t it tit i ti 

iiit ii ii l ii it i ill il ii • it ill i It 

AGCCGCCTCTGCATGGTCAGG— CCCTGCGAAGCTGACC — TGGAGGAAAACATTAAGGTACATCCTCTGCCC 
2220 2230 2240 2250 2260 2270 2280 

1 040 1 050 1 060 1 070 1 080 1 090 1 1 OO 

GGTGGAGTTCAAGTGCCCTGACGGCGAGGTCATGAAGAAGAACATGATGT— TCATCAAGACCTGTGCCT 

• ii i iii i iii i i iii ■ iti iii i iiii i 

■ ii i tii i iii i i i t i i iii lit i iiii i 

CTAGTCACTCCGTTTTACAGAATGACAGGGAAGAGAACCGAGCTGGCTGTCTCA— CCTCCCATGTTATTAGA 

22SO 2300 23 lO 2320 2330 2340 23SO 



1110 1120 1130 1140 1150 1160 1 170 

GCCATTACAACTGTCCCGGAGACA— ATGAC— ATCT — TTGAATCGCTGTACTACAGGAAGATGTACGGAGAC 

tii it it ii i i iii t it i tit ti t i i i i i ■ it 

■ it ii i i i t i i ill i ii iiii ti t i i I t I I ii 

GGCCTGTTGTCT CCAGAAATATCTAACCGTGGAGCTGTCTGGCTAGAATGAGAGATGCTGTAACAACAG 

2360 2370 2380 2390 2400 24 1 0 2420 



1 1 SO 1 1 90 1 200 1 2 1 O 1 220 1 230 

ATGGCA TGAAGCCAGAGAGT GAGAGACATTAACTCATTAGACTGGAACTTGAACTGATTCACATCT 

I I t I I I I I t I It tl lit! I I III llltl I III 

lilt I IIII I _ It II fill I I _ III II III I til 

CTGCCAGTTTTCCACTACAAAA"^fccCCGTGGTGCTAGTTAATTCAA q^piCTCCAAATGAGGC-CATGG 

2430 2440 2<^TO 2460 2470 2480 2490 
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/gene= " + i sp 
1675^ 

/numt^^=3 

/gene= " f i sp 
1 327. . 2058 
/number=3 
/gene= " -F i sp 

2059. . 2270 

/number=4 
/gene= " f isp 
2271. . 2641 

/number=4 
/gene* 11 f i sp 
2642- . 4006 

/number=5 
/gene= " + i sp 

1061 a 1039 c 
chromosome 10 



-12" 




-12" 



-12" 



-12" 



-12" 



12" 
984 g 



1044 t 



Initial Score 
Residue Identity 
Gaps 



498 Optimized Score = 954 Significance 
50% Matches « 1104 Mismatches 

2 1 1 Conservat i ve Subst i tut i ons 



0. OO 

;s 

o 



X lO 20 30 40 SO 

CCCGGCCGACAGCCCCG-AGACGACAGCCCGGCGC-GTCCCGGTCC-CCACCT-CCG-ACCA 

ti i ii • t i i i i i it iiii it ii i i i t i i it iii * 
ti i ii i i i * i i i ii tiii ii it i i i t i i ii tit t 

CAGCCGAAGCAGCGCCGCACTGCCCCGCCGGCGTGAGCCTGGTGCTGGACGGCTGCGGCTGCTGCCGCGTCT 

1250 X 1260 1270 1280 1290 1300 1310 



60 70 80 90 lOO HO 120 

CCGCC-AGC-GCTCCAGGCCCCGCGCTCCCCGCTCGCCGCCACCGCGCCCTCCGCTCCGCCCGC-AGTGCCA 

• i i i i i t i t i i it* it it ti i iii ii it tiii i i i i 

lift tit tit t iii ii t i t i i i i t t i it i i i i tiii 

GCGCCAAGCAGCTGGGAGAACTGTG — TACGGAGCGTGACCCCTGCGACCCACACAAGGGCCTCTTCTGCGA 

1320 1330 1340 1350 1360 1370 1380 



130 140 ISO 1GO 170 180 190 

— ACCATGACCGCCGCCAGTATGGGCCCCGTCCGCGT— CGCCTTCGT — GGTCCTCCTCGCCCTCTGCAGCCG 

t It I I I I I I t t 1 ■ til t t It! t III* tilt I I I t I I 

t tttlllll t II I III It III I till till I I I I I I 

TTTCGGCTCCCCCGCCA — ACCG— CAAGATTGGAGTGTGCACTGGTAAGACCCTCAGC— CCCATTCCAGCCC 
1390 1400 14 lO 1420 1430 1440 1450 



200 21 0 220 230 240 250 260 

GCCGGCCGTCGGCCAGAACTGCAGCGGGCCGTGCCGGTGCC— CGGACGAGCCGGCGCCGCGCTGCCCGGCGG 

t i tiii iii tit i it tti itit i ii t • tti i 

i i itit itt tit i ii tit i t i t i ti t i iii i 

CCTTTGCAGAGGCCTCACCTTTTG-GTGTTGGACCACCACCTCTCTCAAGTC — CAGCGTGATACCC-TCTA 
14G0 1470 1480 1490 1500 1510 



270 280 290 300 310 320 330 

GCGTGAGCCTCGTGCTGGACGGCTGCGGCTGCTGCCGCGTCTGCGCCAAGCAGCTG-GGCGAGCTGTGCACC 

i ii t t I i t I I I I I t 1 tt I it t III ■ tilt till 

I t t I I I tl it ill I I ii 1 ii ■ tit I lilt I I I t 

GAAAAAGAAAAGCCCCTATCCGCAGCTGCTTCCAACCGGCCCCCTGCAGTC — CTGACCCTAGCTCGTCACC 
1520 1530 1540 1550 1560 1570 1580 



340 350 360 370 380 390 400 

GAG— CGCGACCCCTGCGACCCGCACAAGGGCCTCTT — CTGTGA CTTCGGCTCCCCGGCCAACCGCAAGA 

!• i i ti t it i ii i i i i i i i ititit it <ii i 

it i i t i i ii i ii t i i i i i i i t i t i i it iii i 

TTGACATGTACAGTGATA— TAGCTAGCTGTTCTGATCCCTGTGACCCTACGCCTGACC — TCTACAACTTTG 

15SO IGOO 1G10 1620 1G30 1640 1650 



410 420 430 440 450 460 470 

TCGGCGTGTGC— ACCGCCAAAGATGGTGCTCCCTGCATCTTCGGTGGTACGGTGTACCGCAGCGGAGAGTCC 

ti i t i i t ttititiiiittti iiiii • i i i i t i i • i ttitiiiiiiiiiiii i i t i t i 

ii t i i i t itttttiitiiiti tiiti tiiiiittit t i i t i t l i i i i ) i i • i 1111(1 

TCTTCCTCTCCTGCAGCCAAAGATGGTGCACCCTGTGTCTTCGGTGGGTCGGTGTACCGCAGCGGTGAGTCC 
1GS0 1670 1680 1690 1700 1710 1720 1730 



Ml t L3 I MMH IMI IUI UUH IHIHtMl MIMI MIMIMIM1HI I HOHLa I IHIUI H~HL» I I HH I I f HHRL? I UH I I 

1770 1780 1790 1800 1810 . 1820 1830 



1 840 1 850 1 860 1 870 1 880 1 890 1 900 

TAAAGTTGTTT GTGCCTTTTTATTTTTGTTTTTAATGCTTTGATATTTO^ATGTTAGCCTCAATTTCTG 

• ttiiti tiitiit i i i i ii ■■ * i i i i i i iii i i i 

■ i t i i i i i i i i i i i iiii ii ii i t i i iii iii i i i 

TQTTTTTGTTTTAftGTGCTTTTGGGATTTTAAACTSftTAGC-CTCAftACTCCAA — ACACCAT — AGGTAGG 

1840 1850 18G0 1870 1880 1890 1900 

1910 1920 1930 1940 1950 I960 1970 

A-AC — ACCATAGGTAGAATGTAAAGCTTGTCTGATCGTTCAAAGCATGAAATGGATACTTATATGGAAATT 

til iiii i ti iiii ill lit i ■ I t i i tti i ii i 

111 I I I I I II III! III lit I I III I III 1 II I 

ACACGAAGCTTATCTGTGATTCAAAACAAAGGAGATACTGCAGTG GGAATTGTGACCT— GAGTGACTCT 

1910 1 920 1 930 1 940 1 950 1 960 1 970 

1 980 1 990 2000 20 1 O 2020 2030 

CTGCTCAG ATAGAATGACAGTCC GTCAAAACAGAT— TGT— TTGCAAAGGGGAGGCATC— AGT— GTCT 

(■I iiii i i ii i ii ■ ii i ii ii i iii iii it i ii ti iii i 
iii iiii i i it i ii i ii i ii it i iii iii it i ii t i ii* i 

CTG— TCAGAACAAACAAATGCTGTGCAGGTGATAA— AGCTATGTATTGGAA — GTCAGATTTCTAGTA6GAA 
1980 1930 2000 2010 2020 2030 2040 

2040 2050 2060 2070 X 

TGGCAG6CTGAT — TTCTAGGT— A6GAAATGTGGTAGCTCACG 

i ii it i i i i i i iitit i iti 

i i i it i i i i i i i i t i t i t t t 

ATGTGGTCAAATCCCTGTTGGTGAACAAATGGCCTTTATTAAGAAATGGCTGG 
2050 2060 2070 2080 X 2090 



2. US-07-752-427- 1 < 1 -2075 > 

MUSFISP12A Mouse F ISP- 12 protein <f isp-12) gene* complete cds 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
STANDARD 
FEATURES 
CDS 



mRNA 



TATA_s 1 gna 1 



ex on 



intron 



ex on 



intron 



MUSFISP12A 4128 bp ds— DNA ROD 09-JUL-1991 

Mouse F ISP- 12 protein <f isp-12) gene* complete cds. 
M7064 1 

cysteine— rich protein; growth factor- i nducible gene. 
Mus musculus <sub_species domesticus) liver/kidney DNA. 
Mus musculus 

Eukaryota ; An i ma 1 i a ; Metazoa ; Chordata ; Vertebrata ; Mamma 1 i a ; 
Ther i a ; Euther i a ; Rodent i a ; Myomorpha ; Mur i dae ; Mur i nae. 
1 (bases 1 to 4128) 

Ryseck,R. -P. , MacDonald-BravoiH. » Matte i »M. — G. and Bravo, R. 
Structure* mapping and expression of f isp-12, a growth factor 
inducible gene encoding a secreted cysteine— rich protein 
Cell Growth Differ. 2, 225-233 (1991) 
full staff _entry 

Locat i on/Qua 1 i f i ers 

Join( 1052. . 1 1 14, 1202. . 1424i 1675. . 1926,2059. . 2270, 
2642. . 2938) 

/product="FISP-12 protein" 
/gene= "f isp-12" 
/codon_start= 1 

jOin<833. . 1114,1202.. 1424,1675.. 1926,2059.-2270, 
2642. . 4006 ) 
/gene= " f i sp- 1 2 " 
798. . 804 
/gene="f isp-12" 
833. .1114 
/number— 1 
/gene= " f i sp- 1 2 " 
1115.. 1201 
/number^ 1 
/gene= " f i sp- 1 2 " 
1 202. . 1 424 
/numt^^=2 

/gen^ f i sp- 1 2 " 
1425. . 1674 




1010 1020 1030 1040 1050 1060 1070 




1070 1080 1090^ UOO lllO 1 12D 1130 1140 

ARGAAGAACATGATGTTCATCAAGACCT3TGCCTGCCATTACAACTG!TCCCGGAGACAATGACATCTTTGAA 

> i t t t i i i i i t i i i t i i i t i * i i t i i ■ i i i t i t * t i i i t » i i i i i t i i ii i i » i i i t i i i t i t » i i * 
t i i t i i i t i i i i i t ■ t i i i i i ■ i i i i i i i t t i i i i i i i i ■ i i i ii i i t ■ t t i i i i i i t i ■ i t 

AAAAAGAATATGATGTTCATCAAGACCTGTGCCTGCCATTACAACTGTCCTGGGGACAPiTGACATCTTTGAG 
1080 lOSO 1 lOO 1110 1 120 1 130 1 140 

1150 1160 1170 U80 1190 1200 1210 

TCGCTGTACTACAGGAAGATGTACGGAGACATGGCATGAAGCCAGAGAGTGAGAGACATTAACTCATTAGAC 
*• • i • i i • i i i i i t i i i i i • i » i i i i » i i i i i i i t * t i * * i i iii it ii*i ■ ■ « i * i i i i • i ■ 

• • i • « i t i * * ■ ■ i * t i i i i i i i i i i i i i t i ■ i i i i i * i i i i i iii it i i i i i i i * i i i i t i i i 

TCCCTGTftCTftCftGGftftGftTGTftCGGftGft CftTGGCG TftftftGCCftGGftftGTftftGGGftCftCGftftCTCATTftGftC 
1150 1160 1170 1180 I, 1190 p 1200 1210 

1220 1230 1240 125<7 12GO 1270 1280 

TGGAACTTGAACTGATTCACATCTCATTT TTCCGTAAAAATGATTTCAGTAGCACAAGTTATTTAAATCT 

■ I I l i » I I I i I I i i t i i i i i i i i i ill t i i i i i i lit iititlttt l t I t i ■ i i i i i i 

• I t l l l t I I I I t l I I i t | i t i i | I tit t I t t I t l III t t I I i | | t | i t i i i i i i i i t i 

TATAACTTGAACTGAGTTGCATCTCATTTTCTTCTGTAAAAACAATTACAGTAGCAC-ATTAATTTAAATCT 
1220 1230 1240 1250 1260 1270 1280 

1 290 1 300 1310 1 320 1 330 1 340 

GTTTTTCTAACT GGGGGA— AAAGATTCCCACCCAATTCAAAACATTGTGCCATGTCAAACAAATAGT— C 

• ■ (If tlltl t till II I • I I I I I |, | | | lit II l| III! I I I III 

• I til t I I f I I till II I t t I I I I I| | | lit II II till I I I III 

GTGTTTTTAACTACCGTGGGAGGAACTATCCCACCAAAGTGAGAACGTTATGTCATGGCCA — TACAAGTA6 
1230 1300 1310 1320 1330 1340 1350 

1350 1360 1370 1380 1330 1400 1410 

TATCTTCCCCAGACACTGGTTTGAAGAATGTTAAGAC — TTGACAGTGGA— ACT ACATTAGTACACA6C 

i i i ii tii t i it t i t i i t t i ■ i t i t i t t t • i ■ t i t t i 

it* ii til ti t t titi i t t t i i t i t t t t t i i i titit 

TCTGTCAACCTCAGAC ACTG GTTTCGAGACAGTTTACACTTGACAGTTGTTCATTAGCGCACAGT 

1360 1370 1380 1390 1400 1410 1420 

1 420 1 430 1 440 1 450 1 4GO 1 470 1 480 

ACCAGAATGTATATTAAGGTGTGGCTTTAGGAGCAGTGGGAGGGTACCGGCCCGGTTAGTATCATCAGATCG 

iitiit * i i t i i i i i t ii tii tii* i i i i i ii i i I* i t tit • 

• i i i i i i i i i i i t t i t it iii i t t i i i t i t it i i it i t i t i t 

GCCAGAACGCACACTGAGGTGAGTCTCCTGGAACAGT— GGAG— ATGCCAG — GAGAAAGAAAGA-CAGGT — 
1430 1440 1450 1460 1470 1480 

1490 1500 1510 1520 1530 1540 1550 

ACT — CTTATACGAGTAATATGCCTGCTATTTGAAGTGTAATTGAGAAGGAAAATTTTAGCGTGCTCACTGA 

• ti tii ti i tti iii i ttt tit tti tit 

• it tii tt i iti lit t tit tit tti tit 

ACTAGCTGAGGTTATTTTAAAAGCAGCAGTGTGCCTACTTTTTGGAGTGTAACCGGGGAGGGAAATTA— TAG 
1490 1500 1510 1520 1530 1540 1550 

1560 1570 1580 1590 1600 1610 1620 

CCTGCCTGTAGCC CCAGTGACAGCTAG— GATGTGCAT— TCTCCAGC— CATCAAGAGACTGAGTCAAGTT 

i ill t i it i it t ill it i it ittt i tti i i i iti i i i i i i t ii i 

I lit ii ii I it I til ti l t I i I t I I 1 t i t i t til t I I I t i t it i 

CATGCTTGCAGACAGACCTGCTCTAGCGAGAGCTGAGCATGTGTCCTCCACTAGATGAGGCTGAGTCCAGCT 
15GO 1570 1580 1590 1600 1610 1620 1630 

1 630 1 640 1 650 . 1 660 1 670 1 680 1 690 

GTTCCTTAAGTCAGAACAGCAGACTCAG— CTCTGA— CATTCTGATTCGAATGACACTGTTCAGGAATCGGAA 

• * * * tii t t t t i i i t i t ttii ■ i i t t t i t t t i i i t i i i t t t i i t i t t i t t i t ti ii 

• i i i iii i i i i i i t i i i tiii i i i i t t i t t i t i t t t i i t i i ■ i i i t i ■ i i ■ i ti ti 

GTTCTTTA AGAACAGCAGTTTCA6CCTCTGACCATTCTGATTCCAGTGACACTTGTCAGGAGTCAGAG 

1640 1650 1660 1670 1680 1690 

1700 1710 1720 1730 1740 1750 1760 

TCCTGTCGATTAGACTGGACAGCTTGTGGCAAGTGAATTTGCCTGTAACAAGCCAGATTTTTTAAAATTTAT 

' i « t i ttttttitiiiititittiiittti i * i t i i i i i i i t • i i i t t t i • i i t i i ttt it 

* i i t t i t t t t i t t t i i t t i t t i t i t i t t t t i t i t i t i i i i t t i i t i t t t l i t t i i t lit t t 

CCTTGTCTGTTAGACTGGACAGCTTGTGGCAAGTAAGTTTGCCTGTAACAAGCCAGATTTTT ATTGAT 

1 700 1710 1 720 1 730 1 740 1 750 1 760 



1 770 1 780 1 790 1 800 1 8 1 O 1 820 1 830 

ATTGTAAATATTGTGTGTGTGTGTGTGTGTGTATATATATATATA — TATGTACAGTT— ATCTAAGTTAATT 



21 0 ' 220 " 230 240 25$ V260 270 ' 280 

280 290 300 310 320 S <,330 5 s 340 s 

GTGCTGGACGQCT6CGGCTGCTGCCGCGTCTGCGCCAAGCAGCTGGGCGAGCTGTGCACCGAGCGCGACCCC 



• ■•11 11 II 



GTCfcTdGACGGCTGCpGCfrGCT 

290 300 310 320 330 340 350 

350 ^360 370 £ g80 39§ 400 < SUO s 420 

TGCGACCCGCACAAGGGCCTCTTCTGTGftCTTCGGCTCCCCGGCCAPiCCGCAAGAfCGGCGTGTGCACCGCC 

• i i i t i i i i • i i • • i i • * i t i • t i i it ititttitiit i t > i • i i • • i t i > i ii titiiiii iii 
i i i i i t » i i i i i t i i i i i i i i i i i i ti i i i t i § i t i i i t i t i i t i i i i i i i i ii i i i i i i i i iii 

TGqGACfCCAfcAdAAdGGC|CTC^ 

360 * 370 380 390 400 410* 420 



430 440 450 460 470 480 490 

AAAGATGGTGCTCCCTGCATCTTCGGTGGTACGGTGTACCGCAGCGGAGAGTCCTTCCAGAGCAGCTGCAAG 

iitiiiiiiii tiiti t i i i i • i • i i i i i i i * i • • i i i i • i t i i i i • i i i • i i itiiiiiiiit 
i i i i i i i i i i i i i i t t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i 

AAAGATGGTGCACCCTGTGTCTTCGGTGGGTCGGTGTACCGCAGCGGTGAGTCCTTCCAAAGCAGCTGCAAA 
430 440 450 460 470 480 490 

500 510 520 530 540 550 560 

TACCAGTGCACGTGCCTGGACGGGGCGGTGGGCTGCATGCCCCTGTGCAGCATGGACGTTCGTCTGCCCAGC 

I l l l I I I I I I t I I I I I I I i I i I i i l l l l l l l l l i l l l l l I I l l I I l I t I I l l l II l l l • l I l l l 
I t I I I I I I I I I I I » I I I I I I I I I i t t I I I I I I I I I I t I I I I I I I I I I I I I I I I II t I I I I I I I I 

TACCAATGCACTTGCCTGGATGGGGCCGTGGGCTGCGTGCCCCTATGCAGCATGGACGTGCGCCTGCCCAGC 
500 510 520 530 540 550 5GO 



570 580 590 600 61 0 620 630 

CCTGACTGCCCCTTCCCGAGGAGGGTCAAGCTGCCCGGGAAATGCTGCGAGGAGTGGGTGTGTGACGAGCCC 

• i i i i i i i i i i • * i * i i i i i t i i t i t i « i i i • i i i * i i • t i i i i i i t t i i • • t i i i i t i i • i i i « i i i i 

• i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i 

CCTGACTGCCCCTTCCCGAGAAGGGTCAAGCTGCCTGGGAAATGCTGCAAGGAGTGGGTGTGTGACGAGCCC 
570 580 590 600 GIO 620 630 S40 



640 650 660 670 680 690 700 

AAGGACCAAACCGTGGTTGGGCCTGCCCTCGCGGCTTACCGACTGGAAGACACGTTTGGCCCAGACCCAACT 

i i i i i i i ii i iiiti i i i i i • t i ii tt i i i i t i t i i i • i t i i I i i i i t t i i i i i i t i i i i i i 

i i i i i i i il t i ■ i i i i t i i i ' i i i t l ti i i i i i i i i i l i i i i i t i i t t i i i i t i i i i i l l i i l 

AAGGACCGCACAGCA6TTGGCCCTGCCCTAGCTGCCTACCGACTGGAAGACACATTTGGCCCAGACCCAACT 
650 660 670 680 690 700 71 0 



71 0 720 730 740 750 760 770 780 

ATGATTAGAGCCAACTGCCTGGTCCAGACCACAGAGTGGAGCGCCTGTTCCAAGACCTGTGGGATGGGCATC 

I i i i i i i i i I • t t i I i • i • i i t t I I i i • I I I I I I i i t i i I I I I i t I • i I i i i i I I 1 I I I I i i I i t I I I 

I t i I i I I t t i i i i t i I I i i i i t t t i i i i i i i i i i i i I ■ • i i t t i i i i i i i i i i i i t i i i i ■ i i i i i i i 

ATGATGCGAGCCAACTGCCTGGTCCAGACCACAGAGTGGAGCGCCTGTTCTAAGACCTGTGGAATGGGCATC 
720 730 740 750 760 770 780 



790 800 810 820 830 840 850 

TCCACCCGGGTTACCAATGACAACGCCTCCTGCAGGCTAGAGAAGCAGAGCCGCCTGTGCATGGTCAGGCCT 

i i t i i i i i i i i i i • i i i i i t i i iii t i i i i i it t i i i i i i t i i i i i t i i i i i i i i t § i i t i i t i 
i i i i i i i i i i i i i i i i i i i i i t iii i i i i t i ti i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i 

TCCACCCGAGTTACCAATGACAATACCTTCTGCAGACTGGAGAAGCA6AGCCGCCTCTGCATG6TCAG6CCC 

790 800 810 820 830 840 850 



860 870 880 890 900 910 920 

TGCGAAGCTGACCTGGAAGAGAACATTAAGAAGGGCAAAAAGTGCATCCGTACTCCCAAAATCTCCAAGCCT 

i I t l i I i i i t t i i i i i ■ ii t i t t i t I I i i i t i i i I t i i t I I I * i i I i • ii ii i i I i i i i I i i t i t i 
t i i i I I I t 1 i • i i i i i i it i i i I i i i t i t I I i i i t I i i i I i I t t I i I i ii ii itilit I i t I I t I t 

TGCGAAGCTGACCTGGAGGAAAACATTAAGAAGGGCAAAAAGTGCATCCGGACACCTAAAATCGCCAAGCCT 
860 870 880 890 900 910 920 



930 940 950 960 970 980 990 

ATCAAGTTTGAGCTTTCTGGCTGCACCAGCATGAAGACATACCGAGCTAAATTCTGTGGAGTATGTACCGAC 

tititiiitititiiiitittiiittii tttiiitiiti i i i t » i iiiii *i I* it it ttt 
• i i i i t i i i i t i t i i i t i i i i t i i i i i i i > i t i i i i i i i i i i i i i iiiii ti it ii ii iii 

GTCAAGTTTGA6CTTTCTGGCTGCACCAGTGTGAA6ACATACAGGGCTAAGTTCTGCGG6GTGTGCACAGAC 

930 940 950 960 970 980 990 lOOO 

lOOO 1010 lO^k 1030 1040 1060 

ggccgatgctgcaccccccacagRccaccaccctgccggtggagti'caaotgccctgacggcgaggtcatg 

lllll I I I I 1 I I I II I t • I I I I I I I I t I I IIIII I I t I I I 1 I I I t IIIII II I I I I I I tllll 
• til* lllttlll tl I I I I I I I I I I I I I I IIIII Itllttllttl IIIII II I I I I I I I I I t I 







m- vi i itr i ana i i nitrogen tixatiu 








u. 




10. 


HUMHLAA 1 


Human HLA-A1 gene 


3840 


140 


855 


0. 


00 


1 1. 


HSSGPGH 


PseudorAes virus glycoprote 


324^ 


134 


853 


O. 


oo 


12. 


HSSPVGH 


Pseudoraffies v 1 rus g 1 ycoprote 


342^ 


135 


852 


O. 


oo 


13. 


HS5HCMVCG 


Human Cytomegalovirus Strain 


229354 


158 


852 


0. 


oo 


14. 


MUSCYR6 1 A 


Mouse Cyr61 mRNAi complete cd 


2018 


408 


848 


o. 


oo 


15. 


HSSR I CP 1 8 


Pseudorabies virus I CP 18. 5 ge 


2524 


168 


846 


0. 


oo 


16. 


MUS I L5G 


Murine eosinophil differentia 


6727 


130 


844 


o. 


oo 


17. 


HUMMHEA 


Human MHC class I lymphocyte 


4938 


129 


842 


0. 


oo 


18. 


HSEIEP 


Equine herpesvirus type 1 imm 


8174 


131 


840 


o. 


oo 


ia 


AVHNIFREG 


Azotobacter chroococcum nifU» 


7099 


134 


838 


o. 


oo 


20. 


SERERYA 


S. erythraea eryA gene for 6- 


9769 


126 


838 


o. oo 



1. US-07-752-427- 1 ( 1-2075) 

MUSFISP12B Mouse FISP-12 protein <-Fisp-12> mRNA » complete cds 



LOCUS 

DEFINITION 

ACCESSION 

KEYWORDS 

SOURCE 

ORGANISM 



01 * 

-1991 



REFERENCE 
AUTHORS 
TITLE 



MUSFISP12B 2267 bp SS-mRNA ROD 09-JUL 

Mouse FISP-12 protein (fisp-12) mRNA t complete cds. 

M70642 

cysteine-rich protein; growth -Factor™ inducible gene. 

Mus Tnusculus <sub_species domesticus) liver/kidney cDNA to imRNA. 

Mus musculus 

Eukaryota ; An i ma 1 i a ; Metazoa ? Chordata ; Vertebrata ; Mamma 1 i a ; 
Ther i a ; Euther i a ; Rodent i a ; Myomorpha ? Mur i dae ; Mur i nae. 
1 (bases 1 to 2267) 

RyseckiR. -P. » MacDona 1 d— Bravo » H. i Matte i iM. — G. and Bravo tR. 
Structure! mapping and expression of fisp-12» a growth factor 
inducible gene encoding a secreted cysteine— rich protein 



JOURNAL 
STANDARD 
FEATURES 
CDS 



Cell Growth Differ. 2, 225-233 (1991) 
f ul 1 staf f_entry 

Locat i on/Qua 1 i f i ers 
138. - 1 184 
/product=" FISP-12 protein" 
/gene= " f i sp- 1 2 " 
/codon_start= 1 
polyA_signal 2224. . 2230 

/gene- "fisp-12" 
BASE COUNT 570 a 569 C 582 g 546 t 

OR I G I N chromosome 1 O 



Initial Score = 
Residue Identity = 
Gaps 8=1 



1139 Optimized Score = 1522 Significance 
74% Matches = 1 592 M i smatches 

100 Conservative Substitutions 



O. OO 
436 
0 



X lO 20 

CCCGGCCGACAGCCCCGAGACG- 



30 40 SO 60 

•ACAGCCCGGCGCGTCCCGGTCCCCACCTCCGACCACCGCCAGCGC 



• i i i i i * 
i i i i * ■ i 



• •■it 

• i i i i 



GAATTCCGCCGACAACCCC— AGACGCCACCGCCTGGAGCGTCCAGACACCAACCTCCG— CCCCTGTCCG— AA 
X 10 20 30 40 50 60 



70 80 90 100 

TCCAGGCCCCGCGCTCCCCGCTCGCCGCCACCGCGCCCTCC* 



1 10 

-GC-TCCGC- 



120 130 
CCGCAGTGCCAACCATG 



t ■ ■ i i t i 
i i i i i i t 



■ i 



> t i 
• t i 



■ ■ i 

■ i t 



■ i i i 
i i i i 



■ i 



i * 



TCCAGGCTCCAGCCGCGCCTCTCGTCGCCTCTGCACCCTGCTGTGCATCCTCCTACCGC— GTCCCGATCATG 
70 80 90 100 110 120 130 140 



140 ISO 160 170 180 190 200 

ACCGCCGCCAGTATGGGCCCCGTCCGCGTCGCCTTCGTGGTCCTCCTCGCCCTCTGCAGCCGGCCGGCCGTC 



tiii 
i i i i 



■ * 
• i 



i i i 
t t * 



■ ■ i 

• t t 



i t 
i i 



■ i 
t i 



CTCGCCTCCGTCGCAGGTCCCATCAGCCTCGCCT TGGTGCTCCTCGCCCTCTGCACCCGJ 

1 50 1 60 1 70 1 80 1 90 200 C6Ct^G 



ACG 



210 220 230 2Sp S P 250 

GGCCAGAACTGCAGCGGGCCGTGCCGGTGCCCGGACGAGCCGGCGCC 
■ * i t i i t » t • i i » i i ii it t tiii i ■ iii i 
» i > * » • t i i » i . t i i i ii »i i i i t t t 




26Q5 £ 270 .S 
GCCCGGCGGGCGTGAGCCTC 







rseuaoraDitJb vn ub lurio. d ge 




1 DO 




(. 


(\j 


<j 


a 


HUMCANPRA 


Human calcium-dependent prote 


1154 


164 


390 


7. 


46 


0 


10. 


TTHCAAA 


T. thermophi lus cytochrome caa 


1 235 


164 


286 


7. 


46 


0 


1 1. 


XELRGEE83 


x. laevis rma external transc 


865 


162 


210 


7. 


34 


0 


12. 


XELRGMN3 


x. laevis 28s rrna non-transcr 


1 151 


162 


243 


7. 


34 


o 


13. 


HS5HCMVCG 


Human Cytomegalovirus Strain 


229354 


158 


852 


7. 


lO 


0 






*xxx G standard deviations above mean 












14. 


HUMRET5 


Human mRNA for ret pro to— onco 


989 


154 


431 


6. 


86 


0 


15. 


SERERYFGH 


S. erythraea 6-deoxyerythromol 


2243 


152 


394 


6. 


74 


o 


16. 


MUSHTF9 


Mouse DNA for GC rich region 


3725 


151 


869 


6. 


69 


o 


17. 


HVBLE 


Barley (H, vulgare) lectin mRN 


972 


ISO 


405 


6. 


63 


o 


18. 


BLYLEC 


Barley root-specific lectin m 


972 


150 


405 


6. 


63 


0 


19. 


WHTAGGTA 


Wheat ( T. aes t i vum > germ agg 1 u 


998 


149 


417 


6. 


57 


0 


20. 


HUMVIM 


Human vimentin genet complete 


1749 


148 


717 


6. 


51 


o 



Query sequence being compared s US-07-752-427- 1 < 1-2075) 
Number of sequences optimized: 



Results of the optimi zed compar i son o f US-07-752-427- 1 ( 1 -2075 > with: 
Data bank : GenBanK 69* all entries 
Data bank * UEMBL 28 G9» all entries 



PARAMETERS 



Similarity matrix 
Mismatch penalty 
Gap penalty 
Gap size penalty 
Cutoff score 
Randomization group 



Unitary 
1 

1. OO 
O. 33 
124 
0 



K-tuple 

Joining penalty 
Window size 



4 
30 
1 1 



Initial scores to save 
Opt i m i zed scores to save 



20 



Alignments to save 
Display context 



lO 
lO 



SEARCH STATISTICS 



Scores = 



Mean 
820 



Med i an 
821 



Standard Deviation 
O. OO 



T i mes « 



CPU 
00 * 00 s OS. 



Tota 1 El apsed 
OOsOO-" 16. OO 



Number of residues* 

Number of sequences optimized a 



45854 1 



The scores below are sorted by optimized score. 
Significance is calculated based on optimized score. 

A 100% identical sequence to the query sequence was not found. 



The list of best scores is: 



In it. Opt. 

Sequence Name Description Length Score Score Sig. Frame 



1. 


MUSFISP12B 


Mouse 


FISP-12 protein <fisp-l 


2267 


1 139 


1522 


O. 


OO 


0 


2. 


MUSFISP12A 


Mouse 


FISP-12 protein <fisp~l 


4128 


498 


954 


O. 


OO 


o 


3. 


HUMFGFAA 


Human 


fibroblast growth facto 


3901 


146 


878 


0. 


00 


0 


4. 


HUMERP 


Human 


erythropoietin genei co 


3398 


128 


877 


0. 


OO 


o 


5. 


MUSHTF9 


Mouse 


DNA for GC rich region 


3725 


151 


869 


O. 


00 


0 


6. 


HUMSRF 


Human 


s^fem response factor ( 


42qA 

389^ 


134 


868 


O. 


OO 


o 


7. 


MUSERPA 


Mouse 


ei^thropoietlri gene* co 


139 


867 


O. 


OO 


o 


8. 


CHKCEF 


Chicken CEF-IO protein mRNA i 


1805 


326 


865 


O. 


OO 


o 





O 



i i 
i i 



i i i i i i i i 
t I t I l I I i 



score o: 

STDEV -1 



! 1 27 !!!! 253 



380 



;o6 



759 



1012 



1 139 



PARAMETERS 



Similarity matrix 
Mismatch penalty 
Gap penalty 
Gap size penalty 
Cutoff score 
Randomization group 



Unitary 
1 

1. oo 

O. 33 
124 
O 



Initial scores to save 
Optimized scores to save 



20 

20 



K-tuple 

Joining penalty 
Window size 



4 
30 
1 1 



Al ignments to save 
Display context 



lO 
lO 



SEARCH STATISTICS 



Scores = 



Mean 



Med i an 



Standard Deviation 
16. 75 



T i mes s 



CPU 

Ol :04U3. 07 



Total Elapsed 

0 1 fi 1 5 s 09. OO 



Number of res i dues < 

Number of sequences searched * 

Number of scores above cutoff 



74447 1 52 
57272 



The scores below are sorted by initial score. 

Significance is calculated based on initial score. 

A 100% identical sequence to the query sequence was not found. 



The list of best scores is* 



Sequence Name 



Description 



In it. Opt. 
Length Score Score 



Sig. Frame 



1. MUSFISP12B 



2. MUSFISP12A 



3. MUSCYRS 1 A 



4. CHKCEF 



MMCYR6 1 G 



6. HUMNFIL6 



7. PSEPMI 



848 



65 standard deviations above mean **** 
Mouse FISP-12 protein (fisp-1 2267 1139 1522 

27 standard deviations above mean 
Mouse FISP-12 protein < fisp-1 4128 498 954 

**** 22 standard deviations above mean 
Mouse Cyr61 mRNA » complete cd 2018 408 

17 standard deviations above mean **** 
Chicken CEF-IO protein mRNA* 1805 326 

13 standard deviations above mean 
Mouse growth factor inducible 5196 264 

8 standard deviations above mean 
Human gene for nuclear factor 1910 181 

*##* 7 standard deviations above mean 
P. aeruginosa pmi gene encodin 1990 170 718 



65. 66 
27. 40 
22. 02 
17. 13 
833 13. 43 



621 



8. 48 
7. 82 



O 
O 
O 
O 
0 
O 



> □ < 

o; :□ Intel 1 i Genetics 

> □ < 

FastDB - Fast Pairwise Comparison of Sequences 
Release 5. 4 

Results file us-07-752-427- 1 . res made by maryh on Thu 6 Feb 92 1 6 : 25 : 1 4-PST. 



Query sequence being compared » US-07-752-427- 1 ( 1-2075) 
Number of sequences searched! 57272 
Number o-f scores above cutoffs 69 

Results of the initial comparison of US-07-752-427- l ( 1-2075) with' 
Data bank * G enBank 69 » al l entries 
Data bank * UEMBL 28 G9» all entries 



1 OOOOO- 
N 

USOOOO- 

M 

B 

E -** 
R - 

□ 

F 1 OOOO- 

— # 

S 

E 5000- 
O - 
U * 
E 

N * 
C - 
E 

S 1000™ 

— # 
500- 



lOO- 



50- 



140 150 160 170 ISO 130 
LCSMDVRLPSPD CPFPRRVI- GKCCEEWVCDEPK OQT VVGPP.I _AAY -LEDTFGPDPTM I RA NC 

it i ill ii ill i iii i 

it • iii rt »ii i iii i 

LGMTDYLV I VEDDDSA 1 1 PCRTTDP ETPV TLHNSEG V VP AS Y DSRGGFNGTF TVGPYI CEATVKGK 

1 40 1 50 1 60 1 70 1 SO 1 30 

200 2 1 0 220 230 240 250 260 

LVQT — TEWSACSKTCGMG I STRVTNDNASCRLEK9SRLCMVRPCEADLEEN I KKGKKC I RTPK I SKP I KFE 

■ i it « i i i t i > • ■ 

ii it i • i i i i i * • 

KFQT I PFNVYALK AT SELDLEMEALKTVYK-SGET I WTCAVFNNE WDLQWT YPGE VKGKG I TMLE 

200 2 1 0 220 230 240 250 260 

270 280 290 300 3 1 0 320 330 

LSGCTSMK TYRAKFGGVCTDGRCCTPHRTTTLPV -EFKCPDGEVMKKNMMF I K TCACH YNCPGDND I FE 

iii ii i • iii ii > » 

tit ii i i iii ii ii 

E I KVPS I KLVYTLTVPEATVKDSGDYECAARGATREVKEMKK VT I SVHEKGF I E I KPTFSQLEAVNLHEVKH 
270 280 230 300 310 320 330 

340 X 
SLYYRKMYGDMA 

« 

FWEVRAYPPPR I SWLKNNLTL 
340 X 350 



□ ; !□ Intel 1 iGenet ics 
> □ < 

FastDB - Fast Pairwise Comparison of Sequences 
Release 5. 4 

Results file us-07-752-427-2-a-geneseq. res made by maryh on Thu 6 Feb 92 15:28s31-F 



Query sequence being compared * US-07-752-427 -2 < 1-343) 
Number of sequences searched « 14140 
Number of scores above cutoffs 3377 

Results of the initial comparison of US-07-752-427-2 < 1-349) with 
Data bank = A-GeneSeq 5i all entries 

1 OOOO- 

N 

U 5000- 

M * 
B 

E - * * * 

R 

— # 

□ 

F lOOO- * 
S 

E 500- * 
0 

u * 

E 

N 
C 

E * * 

S 1 00- * 



iO- 




score o: 

STDEV -1 




8 



lO 




13 



15 



PARAMETERS 



Similarity matrix 
Mismatch penalty 
Gap penalty 
Gap size penalty 
Cutoff score 
Randomization group 



Unitary 
1 

1. 00 
O. 05 

6 
O 



Initial scores to save 
Opt i m i zed scores to save 



20 
20 



K-tuple 

Joining penalty 
Window size 



Alignments to save 
Display context 



2 
20 
1 1 



10 
10 



SEARCH STATISTICS 



Scores s 



Mean 
4 



Med i an 



Standard Deviation 
2. 04 



T i mes * 



CPU 

OO : 00 9 42- 09 



Total Elapsed 
00*01 s27. OO 



Number of res i dues = 

Number of sequences searched* 

Number of scores above cutoff * 



2 1 68208 
14140 
3377 



The scores below are sorted by initial score. 
Significance Is calculated based on Initial score. 

A 100% identical sequence to the query sequence was not found. 
The list of best scores is« 



In it. Opt. 

Sequence Name Description Length Score Score Sig. Frame 







**** 5 standard deviations < 


above mean 












1. 


P93284 


Sequence of clone HIV-2 SBL/I 


3025 


15 


57 


5. 


39 


O 






#### 4 standard deviations < 


above mean 












2. 


PS 1 243 


Human spleen trypsin III (try 


247 


14 


37 


4. 


90 


0 




Rl 1741 


Granulocyte colony stimu latin 


783 


14 


32 


4. 


90 


O 


4. 


Rl 1742 


Clone 25-1 encoded human G-CS 


801 


14 


35 


4. 


90 


0 


5. 


R08039 


Cat heps in D fragment of fibro 


993 


14 


45 


4. 


90 


O 


6. 


P70373 


Human f ibronectin gene produc 


2327 


14 


45 


4. 


90 


0 


7. 


R 1 2 1 85 


Protease inhibitor with varia 


1 14 


13 


17 


4. 


41 


O 


8. 


P9468 1 


Amino acid sequence encoded b 


348 


13 


25 


4. 


41 


O 






3 standard deviations above mean 












9. 


R10109 


Tr i gram i n-beta 1 


72 


12 


20 


3. 


92 


0 


lO. 


P9 1 320 


New trigramin peptide isolate 


72 


12 


20 


■3 a 


92 


0 


1 1. 


RIOl lO 


Tr i gram i n-beta 2. 


73 


12 


20 




92 


O 


12. 


R10106 


Tr i gram i n-gamma. 


73 


12 


19 


3. 


92 


O 


13. 


P96395 


Albolabrin. 


73 


12 


19 


3. 


92 


0 


14. 


ROG395 


A 1 bo 1 abr i n. 


73 


12 


19 


3. 


92 


O 


15. 


R07329 


Amb a I /Antigen E encoded by 


92 


12 


20 


3. 


92 


O 


16. 


R07330 


Amb a I /Antigen E encoded by 


94 


12 


21 


3. 


92 


O 


17. 


R07455 


N-terminal sequence of anti-p 


120 


12 


22 


3. 


92 


0 


18. 


P91901 


Deduced sequence of porcine e 


203 


12 


34 


3. 


92 


O 


19. 


P9 1 376 


Porcine endothel in III. 


203 


12 


34 


3. 


92 


O 


20. 


P90502 


Gp. B E i mer i a tene 11a i mmunoge 


237 


12 


36 


3. 


92 


0 



Query sequence being compared s US-07— 752— 427-2 < 1-349) 



Results of the optimized comparison of US-07-752-427-2 (1-349) with 
Data bank s A-GeneSeq 5i all entries 

1000- 

N 

U 500- 

M 

B 

E 

R 

□ - * 

F 100- * * * 

- * * * * 

S * # * * * * * 

E 50- * * 

O 
U 
E 
N 
C 
E 

S lO- 



* 



STDEV-3 
SCORE36 



-2 



-1 
42 



1 

48 



2 
51 



3 
54 



4 

57 



t 

cr 



GO 



PARAMETERS 



Similarity matrix 
Mismatch penalty 
Gap penalty 

Gap size penalty 
Cutoff score 
Randomization group 



Unitary 
1 

1. 00 

0. 05 
6 
0 



Initial scores to save 
Optimized scores to save 



20 
20 



K— tuple 

Joining penalty 
Window size 



Al ignments to save 
Display context 



2 
20 
1 1 



lO 
lO 



SEARCH STATISTICS 



Scores s 



Mean 
45 



Med i an 
49 



Standard Deviation 
3. OS 



T i mes * 



CPU 
00 = 01 :28. 



Total Elapsed 
OO s 02 s 56. 00 



Number of residues: 

Number of sequences optimized 




1361708 
3377 




The scores below are sorted by optimized score. 



yi ii 1 1 c^ca i iu**w l & udicuiaiea oasea on upx i in i ^ea score. 



The list of best scores is: 



Sequence Name 



Description 



In it. Opt. 
Length Score Score 



Sig- Frame 







**** 5 standard deviations above mean 


**** 










1. 


P92069 


Human muscarinic acetylcholin 


2 1 35 


IO 


63 


5. 


84 


O 






**** 4 standard deviations above mean 


**** 










2. 


P60053 


Sequence of von Willebrand fa 


2813 


1 1 


59 


4. 


54 


O 


3. 


P60462 


Sequence of human von Willebr 


28 1 3 


7 


59 


4. 


54 


O 






3 standard deviations above mean 












4. 


P93284 


Sequence of clone HIV-2 SBL/I 


3025 


15 


57 


3. 


89 


0 


5. 


P80674 


NtrA gene product. 


523 


6 


57 


3. 


89 


O 


6. 


R 1 0534 


Human 160kD mediator of infla 


1427 


IO 


57 


^5. 


89 


O 


7. 


P93357 


Sequence of the catalytic dom 


1 522 


9 


57 


3. 


89 


O 


8. 


R 1 2 1 08 


N-termlnal deleted adenyl eye 


1445 


9 


56 


3. 


57 


O 


9. 


R082G7 


Platelet derived growth facto 


1083 


6 


56 


3. 


57 


O 


10. 


ROSS 1 O 


Alpha type PDGF receptor dedu 


1089 


6 


56 


3. 


57 


O 


1 1. 


P94365 


Sequence of part of adenylate 


1705 


9 


56 


3. 


57 


0 


12. 


R07683 


gag protein precursor » p75gag 


636 


6 


56 


3. 


57 


O 


13. 


P80810 


Sequence of pol protein of HI 


1014 


6 


55 


3. 


25 


0 


14. 


P90599 


Human ret i nob 1 astoma. 


970 


6 


55 


3. 


25 


o 


15. 


R07713 


Human 1 ow dens i ty 1 i pop rote i n 


800 


8 


55 


3. 


25 


o 


16. 


P60057 


Factor IX/Factor VII fusion p 


453 


9 


55 


3. 25 


o 


17. 


P90 1 80 


Tissue plasminogen activator 


534 


6 


55 


3. 




0 


18. 


P60056 


Factor VII peptide encoded by 


466 


9 


55 


3. 




o 


19. 


R0803 1 


Adenyl cyclase from Bordetel 1 


1706 


9 


55 


3. 


25 


o 


20. 


P60055 


Partial Factor VII peptide. 


371 


9 


55 


3. 


25 


o 



1 . US-07-752-427-2 ( 1 -349 ) 

P92069 Human muscarinic acetylcholine m2 v receptor gene. 

ID P92069 standard? protein; 2135 BP. 

AC P92069 5 

DT 27-SEP- 1 989 ( f i r st entry ) 

DE Human muscarinic acetylcholine m2 receptor gene. 

KW Muscarinic acetylcholine receptor? drug screening? probes? m2? ss. 

OS Homo sapiens. 

FH Key Locat i on/Qua 1 i f i ers 

FT CDS 120. .1517 

FT /*tag= a 

FT /product-MAR subtype m2 

FT mi sc_s i gna 1 74 

FT /*tag= b 

FT /label=spl ice acceptor site 

FT /note=def ines the 5' end of the exon 

FT po 1 yA_s i te 74. . 2096 

FT /*tag= c 

PN US724 1 97 1 -A. 

PD 1 4-MAR- 1 989. 

PF 08-SEP-1988? 241971. 

PR 08-SEP- 1 988 ? US-24 1 97 1 . 

PA (USSH) Nat Inst of Health. 

DR WPI? 89-165452/22. 

DR P-PSDB? P96203. 

PT Cloned genes for muscarinic acetylcholine receptors - 

PT for drug screening and diagnostic use. 

PS Disclosure? p? English. 

CC The sequence may be useful for synthesis of hybridisation probes for 

CC diagnostic use. The genes are cloned by screening a rat cerebral cortex 



L^UIXM 1 IDT di y W I in ci pT UDtJ UdSCU UM I I \\J\^ 1 fc?LJ l# I uco 1 f v ££«J t-JT Cnt3 TTUI l"UJa 1 i »y 

CC strand o-F porcine brain (ml) cDNA (modified at positions 5* 38 and 53)? 

CC identifying cDNA clones by hybridisation with BamHI or Xhol digests of 

CC pi asm id DNA from 12 cultures contg, up to 500000 independent clones; 

CC reculturing until a pool contg. less than 5000 indep. clones with a 

CC single hybridisation band are identified! and isolating individual clones 

CC by colony hybridisation. 

CC Stable cell lines are produced by transfecting Chinese hamster ovary cells 

CC (CHD-K1) with various pCD vectors contg. the gene inserts. 

CC See also N92068-N92067 and N90086. 

SO Sequence 2135 BP? G29 A; 462 C; 448 G; 59S T; 



Initial Score = 10 Optimized Score « 63 Significance = 5.84 

Res i due Ident i ty = 20% Matches « 75 M i smatches = 266 

Gaps = 20 Conservative Substitutions - 0 

X 10 20 30 40 SO 60 

MTAASMGPVRVAFVVLLALCSRPAVGQNCSGPCRCPDEPAPRCPAGVSLVLDGCGCCRVCAK 

• tti i it iii 

t lit t it iii 

AAAGTCAACCGCCACCTCCAGACCGTCAACAATTACTTTTTATTCAGCTTGGCCTG-TGCTGACCTTATCAT 

270 X 280 290 300 310 320 330 



70 80 90 lOO HO 120 130 

QLGELCTERDPCDPHKGLFCDFGSP ANRK I GVCT-^-AKDGAPC I FGGT V YRSGESFOSSCK YOCTCLDG — A 

it t till i i i i iii i t i 

ii i iiii tilt iii t i i 

AGGTGTTTTCTC-CATGAACTTG-TACACCCTCTACACTGTGATTGGTTACTGGCCTTTGGGPiCCTGTGGTG 
340 350 360 370 380 390 400 



140 ISO 160 170 180 190 200 

VGCMPLCSMD VRLPSPDCPFPRR VK LPGK CCEE W VCDEPK DOT WG — PALAAYRLEDTFGPDPTM I RANCL 

ii i i ■ it it i i t i 

t i i i i t i it t iii 

TGTGACCTTTGGCTAGCCCTGGACTATGTGSTCAGCAAT6CCTCAGTTATGAATCTGCTCATCAT CAGC- 

410 420 430 440 450 4B0 470 



210 220 230 240 250 2G0 270 

VQTTEWSACSKTCGMG I STRVTNDNASCRLEKQSRLCMVRPCEADL.EEN IKKG — KKC I RTPK ISKPI KFEL 

III! III III II 

I I t I It I III II 

-TTTGACAGGTACTTCTGT-GTCACAAAACCTCTGACCTACCCAGTCAAGCGGACCACAAAAATGGCAGGTA 
480 490 500 5 1 0 520 530 540 



280 290 300 3 1 0 320 330 

SGCTSMKTYRAKFCGVCT— DGRCCTPHRTTTLPVEFKCPDGEVMKKNMMF I KTC ACHYNCPGDND I FES 

t i it t ii t tit iii ii ii t it 

ii t i i it i tit tii it ii i ii 

TGATGATTGCAGCTGCCTGGGTCCTCTCTTTCATCCTCTGGGCTCCAGCCATTCTCTTCTGGCAGTTCATTG 
550 560 570 580 590 600 610 



340 X 

LYYRKMYGDMA 

• i 
i i 

TAGGGGTGAGAACTGTGGAGG 
620 X 630 



2. US-07-752-427-2 (1-349) 

P60053 Sequence of von Wi 1 lebrand -factor <vWF) 

ID P60053 standard; Protein; 2813 AA. 

AC P60053 ; 

DT 22-JUL-1991 (-first entry) 

DE Sequence of von Wil lebrand factor <vWF). 

KW Vascular injury; platelet plug formation. 

□S Homo sapiens. 

PN EP- 1 97592-A. 

PD 1 5-0CT- 1 986. |£ < 

PF 26-MAR-1986 5 200518. ™ 

PR 0 1 -APR- 1 985 ; NL-00096 1 . 




PA ( FRIN-) STICHT FRINDEj^RANT. 

PI Pannekoek Hi Verwey Diergaarde PJ, Hart MHL;{^ 

DR WP I ; 86-273504/42. * 

DR N-PSDB ; N6OO6 1 . 

PT RecoTnb i riant cDNA pi asm id or phage - contg. C-DNA fragment which 

PT codes -for biological activity of human von Willebrand -Factor 

PS Disclosure; Fig 3; 37pp; English, 

CC vWF (glyco) protein having the AA sequence corresponding to the 

CC nucleotide sequence of 2518-8667 or 295-2517 of N60061 is claimed. 

CC Also claimed are new microorganisms! animal cell or human cell contg. 

CC the recombinant cDNA pi asm id or phage; e.g. strain E. col i DH1 contg. 

CC the recombinant cDNA pi asm id pSP8800vWF is deposited as C. B. S. No 

CC 1G3. 8G. 

SQ Sequence 2813 AA; 

so isoa; 137R; ioin; 160D; o b; 217c; 142g; isie; o z; 207G; 77 h; 

SO 97 i; 233L; 104K; 55 m; 93 F; 173P; 201S; 146T; 27 w; 81 y; 221V; 

Initial Score - 11 Optimized Score « 59 Significance = 4.54 

Residue Identity « 20% Matches = 77 Mismatches « 248 

Gaps - 43 Conservative Substitutions » O 



X lO 20 30 40 SO 

MTAASMGPVRV-AFVVLLALCSRPAVGQ-NCSGPC-RCPDEPAPRCPAGVSLVLDGCGCCRV 

t ■ t I I till! • • * 

I til I I III I t I I 

NYNGNQGDDFLTFSGL.AEPRVEDFGNAWKLHGDCODLQKOHSDPCALNPRMTRFSEEACAVLTSPTFEACHR 
530 X 540 550 5G0 570 580 590 



60 70 80 90 100 110 120 130 

CAKOLGELCTER-DPCDPHKGLFCDFQSPANRKIGVCTAKDGAPCIFGGTVYRSQESFOSSCKYOCTCLDGA 

■ i i i i iiit i - i i iii 

■ * i t i iiit it t iii 

AVSPLPYLRNCRYDVCSCSDGRECLCGALA SYAAACAGRGVRVAWREPGRCELNCPKGQ-VYLGCG 

600 610 620 630 640 650 660 

140 ISO 1GO 170 180 190 

VGCMPLCSMDVRLPSPD CPFPRRVKLPGKCCEEWVCDEPKDQTWGPALAAYRLEDTFGPDP 

II lit II it 1 ill ill 

It III II II * lit til 

TPCNLTC RSLS YPDEECNEACLEGCFCP PGL YMDERGDC VPK AQCPC Y YDGE I FQPED I FSDHH 

670 680 690 700 710 720 



200 21 0 220 230 240 250 260 

TM I RANCLVQTTEWSACSKTCGMG I STRVTNDNASCRLEKOS RLCMV— RPCEADLEEN IKKGKKC I RTP 

, t it t l i t l • t i i i i tit 

, , it i it II i tt tit til 

TM CYCEDGFMHCTMSGVPGSLLPDAVLSSPLSHR-SKRSLSCRPPMVKLVCPAD — NLR AEGLECTK T— 

730 740 750 760 770 780 790 

270 280 290 300 310 320 330 

KISKPIKFELSGCTSMKTYRAKFCGVCTDGRCCTPHRTTTLPVEFKCPDGEVMKKNMMFIKTCACHYNC-PG 

lit i I t I til ii 

tti ttit i t i it 

CQNYDLECMSMGCVS — GCLCPPGMVRHENRCVALERCPCFHQGKEYAPGETVK I GCNTCVCRDRKWNCTDH 
800 8 1 0 820 830 840 850 8GO 



340 X 

DND I FESL Y YRKM YGDM A 

• i i 

t i t 

VCD ATCST I GMAH YLTFDGLK YLFPGEC 
870 880 



3. US-07-752-427-2 ( 1 -349 ) 

P60462 Sequence of human von Willebrand Factor <VWF) prec 

ID PG0462 standard; Protein; 2813 AA. 

AC P60462 ? 

DT 25-JUN-1391 (first entry) 

DE Sequence of human von Willebrand Factor (VWF) precursor. 



nw unronic renai tauure» xnerapy; tacxor viii 
□S Homo sapiens. 

PN W08606096-A. 

PD 23-0CT-1986. 

PF 1 O-APR- 1 986 ; U00760. 

PR 1 1 -APR- 1 985 ; US-722 1 08. 

PA (CHIL-) CHILDRENS MED CENT. 

PA (GINS/) GINSBURG D. 

PI Ginsburg Dt Orkin SHi Kaufman RJ; 

DR WPi; 86-291663/44. 

DR N-PSDB; N60404. 

PT Pure Von Willebrand Factor - produced using an expression vector 

PT including a DNA sequence encoding -functional VWF protein 

PS Disclosure; Table 2» Pages 18-36A; 54pp; English. 

CC cDNA clones pVWH33t pVWHS and PVWE6 which span 9 kb pairs of DNA and 

CC encompass the entire protein coding region of VWF* were selected to 

CC construct full length cDNA (N60404). The pure VWF produced is useful 

CC in the treatment of von WillebrancTs disease (VWD) and the patients 

CC with chronic renal failure whose abnormal bleeding times are 

CC corrected by crude cryoprecipi tate. Pure VWF can also be used to 

CC carry i stabilise and improve the therapeutic efficacy of factor 

CC VIII *C. 

SQ Sequence 2813 AA; 

SQ 154A* 143R; 98 Ni 155D? 0 Bt 234C; 133©; 181E; O Z> 205G5 70 H; 

SQ 95 I » 227LJ 108K; 56 m; 89 F; 176P; 207S; 151T; 26 w; 79 y; 226V; 

Initial Score = 7 Optimized Score = 59 Significance = 4.54 

Res i due Ident i ty = 20% Matches «■ 76 Mi smatches «■ 259 

Gaps = 28 Conservative Substitutions = 0 

X lO 20 30 40 SO 60 

MTAASMG— PVRVAFVVLLALCSRPAV— GQNCSGPCRCPDEPAPRCPAGVSLVLDGCGCCRVC 

• i i i > t i> iiii lit 

i t i tti it iiii iti 

CLPDK VCVHRST I YPVGQFWEEGCDVCTCTDMEDAVMGLRV AGCSGKPCE — DSCRSGFTYVLHEGECCGRC 

2430 X 2440 2450 2460 2470 2480 2490 

70 80 90 lOO 110 120 

AKQLGELCT ERDPCDPHKGLFCDFGSPAN-RK I GVCTAKDGAFC I FGGTV-YRSGESFQSSCK YOCTCL 

ti i i iiiii ii t ii 

II I f lllll II I It 

LPSACEWTGSPRGDSQSSWKS VGSQWASPENPCL I NECVRVKEEVF I GQRNVSCPQLEVPVCPSGFGLSCK 
2500 2510 2520 2530 2540 2550 2560 

130 140 150 160 170 180 190 
DGAVGCMPLCSMDVRLPSPDCPFPRRVKLPGKCCEEWVCDEPKDQTWGPALAAYRLE DTFGPDPTM I R 

iiii t iiii it it ti ill 

ttii I titt ti ii it tii 

TSA — CCPSC RCERMEACMLNGTV I GPGKTVM I DVCTTCRCMVQVG-V I SGFKLECRKTTCNPCP — LG 

2570 2580 2530 2600 26 1 O 2620 2630 

200 210 220 230 240 250 260 

ANCL VGTTE WS ACSK TCGMG I STRVTNDNASCRLEKOSRLCMVRPCEADLEEN I KKGKKC I RTPK ISKPIKF 

t i ii i i t t i t i 

it ii it t t i i i 

YKEENNTGECCGRCLPT ACT I QLRGGQ I MTLKRDETLQDGCDTHFC-K VNERGEYFWEKRVTGCPPFDEHKC 

2640 2650 2660 2670 2680 2690 2700 

270 280 290 300 310 320 330 

ELSGCTSMKT YRAKFCGVCTDGRCCTPHRTTTL-PVEFKCPDGEVMKKNMMF I KTCACHYNCPGD— ND — I F 

tit it t tti »t ii iii 

• it ti t iti it ii iii 

LAEGGK I MK- I PGTCCDTCEEPEC ND I TARLGYVK VGSCKSEVEVD I H YCQGK CASK AM YS I D I NDVODO 

2710 2720 2730 2740 2750 2760 2770 

340 X 

ESLYYRKMYGDMA 

• i 

i i 

CSCCSPTRTEPM9VALHCTNGSV|^ 

2780 X 2790 




4. US-07-752-427-2 (1-349) 

PS3284 Sequence oAlone HIV-2 SBL/ISY. 



ID P93284 standard; protein; 3025 AA. 

AC P93284 ; 

DT 06-APR-1990 (first entry) 

DE Sequence of clone HIV-2 SBL/ISY. 

KW HIV-2; provlral clone HIV-2 SBL/ISY; 

□S Human inrniunodef iciency virus 2. 

PN US733 1 2 1 2-A. 

PD 29-AUG- 1 989. 

PF 31-MAR-1989; 331212. 

PR 31 -MAR- 1989? US-331212. 

PA (USSH) US Dept. Health and Human Services. 

PI Franchini G. Wong-staal F» Gallo R; 

DR WPI; 89-339698/46. 

DR N-PSDB ; N92 1 1 9. 

PT Complete human immunodeficiency type 2 proviral clone - used to generate 

PT animal model for function studies of HIV genes in vivo. 

PS Disclosure; Fig. 5; 43pp; English. 

CC The protein is encoded by the second reading frame of HIV-2 SBL/ISY f a 

CC proviral clone of HIV-2. 

SO Sequence 3025 AA; 

so 143 a; 278 r; 126 n; 104 d; o b; 96 c; 191 o; us e; o z; 244 g; 

SO 95 H ; 1 53 I ; 224 L ; 246 K ; 65 Ml 63 FJ 160 PJ 291 SJ 193 Tl 53 Wl 

SO 80 Y? 105 v; 



Initial Score 
Residue Identity 
Gaps 



15 Optimized Score «■ 57 Significance = 3. 

20% Matches = 75 Mismatches « 256 

41 Conservative Substitutions = o 



X 10 20 30 40 

MTAASMGPVRVAFVVLLALCSRPAVGGNCSGPCRCPDEPA- 



SO 

■PRCPAGVSLVLDGCGC 



MDSHPETSGCPKLGGTNLPRNKDOTLMA — NRKDDTHGRST VDR I SRSRAGGKQNYLKPGTGGTLLPRGKRV 
980 990 1000 1010 1020 1030 1040 

60 70 80 90 1 OO 110 1 20 

CRVCAK QLGELCTERDPCDPHKGLFCDFGSPANRK I GVCT AKDGAPC I FGGTV YRSGESFQSSCK YOCT 



RSNSSKGSROSVD I STPGRKNSKSRK I CKDKK YPYORGOVVGT GSSENRKRSTSHLGTNTK I SPTSRKR 

1 050 1 060 1 070 1 080 1 OSO 1 1 00 1110 



130 140 150 160 170 180 190 

CLDGAVGCMPLCS— MDVRLPSPDCPFPRRVKLPGKCCEEWVCDEPKDQTVVGPALAA YRLEDTFGPD 

t t i i i ii it i iiit i iii i 

• • • i i i i t i i i iii i iii i 

DLGTVVG — LLASDMDPRLGLR I HPTVGOVS I PGKRSYTRRRD LLHGW I L8A I KRGKSR I YNRRKRQ 

1120 1130 1140 1150 1160 1 170 1180 

200 210 220 230 240 250 

PTM I RANCLVQTTEWSACSKTCGMG I STRVTNDNASCRL EK QSRLCM VRPCEADLEEN I KKG-KKC I R 

• <*'•*• t > tit i i i i ■ i 

i I t I i i i i I ill l i iii i 

SKD I RAN — YQPT — SR I RSLCNGSNRLRSKSQYCS— RLTVCNGNSNRPTG I RENSKN YRRDDKKGSNLCCM 

1190 1200 1210 1220 1230 1240 

260 270 280 230 300 31 0 320 

TP-K I SKP I KFELSGCTS— MKTYRAKFCGVCTDGR— CCTPHRTTTLPVEFKCPDGEVMKKNMMF I KTCACHY 
■ * * » •! i*i ii ii i i 

GPGPORHRRKSRNPLSKSGHOTS I I PRENRARSGRTWK I SGCKRTSPVW I TOPGGKTNSKHMCPVPTKRGSY 

1250 1260 1270 1280 1290 1300 1310 



330 340 X 

NCPGDND I FESLYYRKMYGDM A 

i i 
i ■ 

-TWASKCRTRHLANGLHTLRMKNHYSSSTCCK 



i^^U l^S^U isSOU 



5. US-07-752-427-2 ( 1-343) 

P80674 NtrA gene product. 

ID P80674 standard; protein; 523 AA. 

AC P80674 ? 

DT 24-0CT-1990 (first entry) 

DE NtrA gene product. 

KW Nt r A ; si gtna factor ; t ranscr i pt i ona 1 act i vat i on ; n i trogen ass i m i 1 at i on ; 

KW fixation; nif operon ; C4— dicarboxylate transport. 

□S Rhizobium meliloti. 

PN EP-292984-A. 

PD 30-N0V-1988. 

PF 27-MAY- 1 988 ; 1 08482. 

PR 29— MAY— 1 987 ; US-OS5228. 

PA ( GEHO— ) Gen Hosp i ta 1 Corp. 
PI Ronson C* Ausubel F; 

DR WP I ; 88-33956 1 /48. 

DR N-PSDBJ N81266. 

PT Cloned rhizobiutn meliloti ntrA gene - for use in C4-dicarboxy late 

PT transport » nitrate assimi latlorii symbiotic nitrogen fixation and 

PT identifying pathogenic genes. 

PS Claim l; Page 12; I9pp; English. 

CC The ntrA gene product is a transcriptional activator. It controls the 

CC processes of nitrate assimilation? symbiotic nitrogen fixation and 

CC C4-dicarboxy late transport* working in conjunction with NtrC» Nif A and 

CC DCtD resp. It can also be used to identify genes involved in 

CC pa t hogene sis. 

CC See also P82126. 

SQ Sequence 523 AA; 

S9 60 a; 43 r; is n; 43 d; o b; 4 c; 25 q; 39 e; o z; 29 g; 12 h; 

sq 23 1; 58 l; 15 k; 13 m; 17 f; 25 p; 37 s; 21 t; 3 w; 8 y; 33 v 

Initial Score = 6 Optimized Score = 57 Significance = 3.89 

Residue Identity = 20% Matches « 75 Mismatches « 257 

Gaps = 39 Conservative Substitutions = O 

X 10 20 30 40 50 

MTAASMGPVRVAFWLLALCSRPA — VGQNCSGPCRCPDEP — APRCPAGVSLVLDGCGCCR 

tii 11 111 iii iti if 

111 1* 111 111 111 tt 

LHLRQSBSLVMTPQLMQS I QLLQMNHLELSHF I AQEVEKNPLLEVQPADEPT I SDRED AGPHP AETG-GETD 

10 X 20 30 40 50 GO 70 

GO 70 SO 90 1 OO HO 1 20 

VCAKQ — LGELCTERDPCDPHKGLFCDFGSPANRK I GVCTAKOGAPC I FGGTVYRSGESFQSSCK YQCTCLD 

iti i iitiii iii i t i 

iii i § i i i i i iii t i i 

EAAGQSDLYDSAMSRSGERLSEGLDADF AN— VFPDDTAPQRADAPELLGQWKSMPGAGDAEGY — DLDD 

80 90 lOO 110 120 130 140 

1 30 1 40 1 50 1 60 1 70 1 80 1 90 

GAVGCMPL— CSMDVRLP SPDCPFPRRVKLFGKCCEEWVCDEPKDGTVVGPA LAAYRLEDTFGPDP 

ii ii it i it ii ii 

ii I* ii i ftii ii 

F VGGRKTLRETLAEQLPF ALS AVSDRL I AR YF I DQLDD AGYLH ADL AET AETLG A AGEDV ARVLHVLQOFDP 

150 1G0 170 180 190 200 210 

200 210 220 230 240 250 260 

TM I R ANCLVGTTEWS A CSKTCGMG I STRVTNDNASCRLEK9SRLCMVRPCEADLEEN I KKGKKC I RTP 

• iii ii ii iii ii 

i i i i ii ii iii ii 

PGVFARTL GECLA I OLRARNRLDPAMEAL VANLELLARRDF ASLKK I CGVDEEDL I DMLAE I RKLDPKP 

220 230 240 250 2S0 270 280 



270 280 |fc 290 300 Ap> 320 

K I S-KP I KFEUSGCTSMKT YRAI^?GVCTD5RCCTPHRTTTL.PVEFKCPD^^iEVMKKNMMF I KTC ACHYNC 



m ! 5rc I mvrt Hi lruVVVKH HrUlalaWUVtUNrUHL^VL-VrNMUYr- i t:li)K^i>KNrN^Lat:UHI-LrNtU 

290 300 _ 310 320 330 340 



330 340 X 

-PGDND I FESLYYRKMYGDMA 
• tii 





LQN ANWLTRSLDQR ART I MKVASE I VRGGD A 
350 3GO 370 380 



6. US-07-752-427-2 < 1-343) 

R10534 Human 160kD mediator of inf lammat ion protein- 
ID R10534 standard; Protein; 1427 AA. 
AC R 1 0534 ; 

DT 12-APR-1991 (first entry) 

DE Human ISOkD mediator of inflammation protein. 

KW Med i a tor of I mf 1 ammat i on ; cy tok i ne ; Hodgk i n • s l ymphoma ; MRP- 1 60. 

OS Homo sapiens. 

PN EP-4 1 2050-A. 

PD OS-FEB- 1 99 1 . 

PF 26-JUN-1990; 810481. 

PR 05- JUL- 1989; GB-015414. 

PA (CIBA ) CIBA GEIGY AG. 

PI Odink KG » Tarcsay L, Bruggen J. Wiesendanger W* Cerletti N; 

PI Sorg C» Dewol f-Peeters C» Delabie J; 

DR WP I ; 9 1 -0389 1 3/06. 

DR N-PSDB ; 0 1 0378. 

PT 160 kD human polypeptide mediator or precursor of inflammation - 

PT polyclonal or monoclonal antibodies to polypeptide treat and 

PT diagnose chronic inflammation and hodgk ins lymphoma 

PS Claim 3; Page 32; 47pp; English. 

CC The protein is a cytokine used to treat chronic inflammatory 

CC conditions. It is prepared by chroma tographical ly purifying an 

CC optionally pre-purified cell extractt cell supernatant or cell 

CC filtrate of stimulated normal human leucocytes or human embryonic 

CC epithelial lung cells. Alternatively, the protein can be produced 

CC by microorganisms or continuous mammalian cell lines* transformed 

CC with pi asm ids encoding MRP— 1 60. The invention also covers the 

CC polypeptide fragment from amino acids 878—1427 and derivatives of 

CC the protein in which the amino and/or hydroxy 1 functions are 

CC glycosylated or acylated and have mol. wt. s of 190 and 140kD» 

CC respect i ve 1 y. 

S<3 Sequence 1427 AA; 

sq i ISA; 67 r; 59 n; 70 d; o b; 14 c; 90 o; 193E; o z; 49 g; 26 h; 

SQ 45 I ; 156L; 153K; 33 M; 29 F; 34 P; 121S; 88 t; 5 w; 9 y; 65 v; 



Initial Score = 10 Optimized Score * 57 Significance = 3.89 

Residue Identity = 19% Matches = 73 Mismatches = 258 

Gaps = 39 Conservative Substitutions - 0 

X 10 20 30 40 50 60 



MTAASMGPVRVAFWLLALCSRPAVGQNCSGPCRCPDE — PAPRCPAGVSLVLDGCGCCRVC 

> i tit tiit iii 

• i iii i i i i t t i 

SNLTKTASESISNLSEAGSIKKGERELKIGDRVLVGGTKAGVVRFLGETDFAKGEWCGVELDEPLGKNDGAV 

190 X 200 210 220 230 240 250 



70 80 90 lOO HO 120 

AKQLGELC TERDPCDPHKGLFCDFGS PANRK I GVCTAKDGAPC I FGGTV YRS— GESFQSSCK YQCTCL 

• * it i i I i i i ii t it i 

1 1 ' » I I I I I I II I II I 

AGTRYFQCQPK YGLFAPVHKVTK I GFPSTTPAKAK ANAVRRVMATTSASLKRSPSASSLSSMSSVASCV 

260 270 280 290 300 310 320 



130 140 150 160 170 180 190 

DGAVGCMPL CSMDVRLPSPDCPFPRRVKLPGKCCEEWVCDEPKDQTWGPALAAYRLEDTFGPDPTMI 



ooKr'oK I LaLL 1 1 1 bbK Y HKf\ I bbi I ! RLftfcHLKt^bWH 1 bULLHtKULtKHt V^H- I oM Viae. I tWc.L-HL_HKiJ 

330 340 350 3G0 370 380 390 



200 210 220 230 240 250 260 

RANCLVGTTEWSACSKTCGMG I STRV-TNDNASCRLEKOSRLCMVRPCEADL EEN I KKGKKC I RTPK 

ii iii it titt itiii i 

ii iii ii ■ i i i § i i t i i 

GHDQHVLELE AKMDQL.RTMVEAADREK VELLNO— LEEEKRK VE— DLQFRVEEES I TKGDLETQTKL 

400 4 1 0 420 430 440 450 460 

270 280 230 300 3 1 0 320 330 

ISKPIKFELSGCTSMKTYRAKFCGVCTDGRCCT PHRTTTLPVEFKCPDGEVMKKNMMFIKTCACHYNCP 

it iii iii ii ii ii 

ii iii iti ii ii ii 

EH AR I KELEQSLLFEKTK ADKLGRELEDTRV AT VSEKSR I MELEKDLALRVGEV AELRRRLESNKP 

470 480 490 500 510 520 

340 X 
— GDND I FESL Y YRKM YGDM A 

iii ii 
lit il 

AGD VDMSLSLLGE I SSLGEKLEVTRTDHQR 
530 540 550 



7. US-07-752-427-2 < 1-343) 

P33357 Sequence of the catalytic domain of Bordetella per 

ID P93357 standard; protein; 1522 AA. 

AC P33357 ; 

DT 22-MAR- 1 93 1 (first entry > 

DE Sequence of the catalytic domain o-F Bordetella pertussis 

DE adenylate cyclase (AC). 

KW Vaccine; ss. 

□S Bordetella pertussis. 

PN FR2G 1 8453-A. 

PD 27-JAN-1989. 

PF 24-JUL-1987. 

PR 24-JUL-1987; FR-010589. 

PA ( INSP) Inst Pasteur. 

PI Danchin A» Glaser Pi Ullmann A. 

DR WPi; 89-079098/11. 

DR N-PSDB; N90B59. 

PT Cloning and expressing genes in mult i -protein function system — by 

PT transforming host cell with indicator gene producing protein which 

PT interacts with expression prod, of the gene being cloned. 

PS Claim 14; Fig 2<a-f>; 24pp; French.. 

CC Sequences encoding at least a part of AC able to hybridise with a 

CC gene expressing a protein with AC activity* and proteins with AC 

CC activity able to form immune complexes with specific ant i bod i es » are 

CC claimed. The isolated AC gene can be used to produce Bordetella 

CC pertuss i s tox in. 

SQ Sequence 1522 AA; 

SQ 206A; 95 R; 55 NJ 121D; 0 B; 8 C; G6 6 ? 75 E; O Z; 190G; 32 h; 

SQ 61 I ; 123L; 41 k; 24 m; 3G f; 40 P; 93 s; 86 t; 13 w; 36 y; 121V; 



Initial Score = 9 Optimized Score = 57 Significance « 3. 

Residue Identity = 19% Matches = 71 Mismatches = 258 

Gaps « 35 Conservative Substitutions « O 

X 10 20 30 40 SO 60 

MTAASMGPVRVAFVVLLALCSRPAVGQNCSGPCRCPDEPAPRCPAGVSLVLDGCGCCRVCAK 

• lit I II I lit t 

I III I II I III I 

KRTEN VQYRHVEL ARVG9V- VE VDTLEH VBH I I GGAGNDS I TGNAHD NFL AGGSGDDRLDGG AGNDTL 

1020 1030 1040 1050 1060 1070 1080 



70 80 SO 1 OO 1 1 20 1 30 

QLGELCTERDPCDPHKSLFCDFG^-SP ANRK I GVCT AKDGAFC I FGGTV^SGESFQSSCK YOCTCLDG AV 



1090 HOO lllO 1120 1130 1140 1 ISO 




140 150 160 170 ^ 180 190 
GCMP LCSMDVRLPSPDCPFPR RVKLPGKCCEEWVCDEPKDQTWG PALAAYRLEDT-FGPDPTM 



EPWP ALNLFSVDHVKN I ENLHGSRLNDR I AGDDQDNELWGHDGNDT I RGRGGDD I LRGGLGLDTLYGEDGND 
1160 1170 1180 1190 1200 1210 1220 

200 210 220 230 240 250 260 

I RANCLVGTTEWSACSKTCGMG I STRVTNDNASCRLEKQSRLCMVRPCEADLEEN I KKGKKC I RTPK ISKPI 

i i * it ti till 

i II II II lilt 

I FLQDDETVSDD I DGGAGLHT— VDYSAM I HPGR I VAPHEYGFG I EADLSREWVRK ASALGVDYYDNVR 

1 230 1 240 1 250 1 260 1 270 1 280 

270 280 290 300 3 1 0 320 330 

KFELSGCTSMK TYRAKFCGVCTDGRCCTPHRTTTLPVEFKCPDGEVMKKNMMF I KTCACHYNCPGDND I 

t iiii i ii i iiti * i i t i 

i iiii t ii i i ii i t iiii 

NVENV I GTSMKDVL I GDAQ ANTLMGQGGDDTGAAATANDLLF-GGDGNDMLY6DAGNDT LYGGLGD— DT 

1290 1300 1310 1320 1330 1340 1350 

340 X 
FESLYYRKMYGDMA 

t i 
t t 

LEGGAGNDWFGQTQAREHDVLRGG 
1 360 1 370 1 380 



8. US-07-752-427-2 < 1-349) 

R12108 N-term i na 1 deleted adenyl cyclase from Bacillus sp 

ID R12108 standard; protein; 1445 PiP\. 

AC R12108; 

DT 30- JUL- 1 99 1 ( f i rst entry ) 

DE N-terminal deleted adenyl cyclase from Bacillus sp. 

KW eye 1 i c adenos i ne monophosphate ; cAMF ; NDPK. 

□S Bac i 1 1 us pertuss i s. 

PN W09 1 06664-A. 

PD 1 6— MAY— 1 99 1 . 

PF 31 -OCT- 1990; F00793. 

PR 31 -OCT- 1989; FR-O 14328. 

PR 1 2-APR- 1 990 ; FR-004754. 

PA < INSP > INST PASTEUR. 

PI Lacombe ML. Veron Mi Mock M. Barzu Sarfati R; 

DR WPi; 91-164211/22. 

DR N-PSDB ; (311 826. 

PT Partly new substd. adenosine di « and tri * phosphate derivs. prodn. 

PT — useful as laboratory reagents e.g. affinity chromatography 

PT ligands or fluorescence markers 

PS Claim 11; Fig i ; 54pp; French. 

CC This protein is an N— terminal deletion of the protein encoded by 

CC B. pertussis "cya" gene. The first 261 N-terminal amino acids have 

CC been deleted* without affecting the adenyl cyclase activity of the 

CC protein. The truncated adenyl cyclase was obtained by unidirectional 

CC deletion of the full-length cya gene* followed by expression in 

CC protease— def icient E. col i. The truncated enzyme is useful for 

CC synthesis of cAMP and adenosine phosphate derivatives; to separate 

CC chemically synthesised 3 1 — subst i tuted ATP derivatives and as a 

CC reagent for assay of pyrophosphate i cAMP and calmodulin. 

CC See also 011825 and W09106671-A. 

SO Sequence 1445 AA; 

sq 188a; 76 r; 58 n; 142D; o b; o c; 63 o; 74 e; o z; 214G; 27 h; 

so 58 i ; 126L; 31 k; 17 m; 34 f; 31 p; 77 s; 67 t; 13 w; 34 y; iisv; 



Initial Score = 9 Optimized Score = 56 Significance = 3.57 

Residue Identity « 19% Matches » 70 Mismatches = 259 



uaps 



oonservaxive bUDSxuuxions 



X 10 20 30 40 50 60 

MTAASMGPVRVAFWLLALCSRPAVGQNCSGPCRCPDEPAPRCPAGVSLVLDGCGCCRVCAK 



KRTENVGYRHVEL ARVGGV-VEVDTLEHVQH I I GGAGNDS I TGNAHD NFLAGGSGDDRLDGGAGNDTL 

730 X 740 750 760 770 780 

70 80 90 100 110 120 130 

QLGELCTERDPCDPHKGLFCDFGSPAN RK I GVCTAKDGAPC I FGGTVYRSGESFGSSCK YQCTCLDGAV 

■ t « « i i i i i it i* 

ti tit iitt it ii 

VGGEGQNTV I GGAGDDVFLQDLGVWSNGLDGGAGVDTVK YNVHQPSEERLERMG DTG I HADLQKGTV 

790 800 810 820 830 840 850 

140 ISO 160 170 180 190 

GCMP LCSMDVRLPSPDCPFPR R VKLPGKCCEEWVCDEPK DOT WG — PALAAYRLEDT— FGPDPTM 

l III It lit t t i i t l 

i ill it lit i i iitt 

EK WPALNLFSVDHVKN I ENLHGSRLNDR I AGDD9DNEL WGHDGNDT I RGRGGDD I LRGGLGLDTL YGEDGND 
860 870 880 890 900 91 0 920 

200 210 220 230 240 250 260 

I RANCLVQTTEWSACSKTCGMG I STR VTNDN ASCRLEK OSRLCM VRPCE ADLEEN I KKGKKC I RTPK I SKP I 



■ til 
• t i t 



I FLODDETVSDD I DGGAGLDT— VDYSAM I HPGR I V APHE YGFG I EADLSREW VRK AS ALG VDY YDN VR 

930 940 950 960 970 980 990 

270 280. 290 300 310 320 330 

KFELSGCTSMK TYRAKFCGVCTDGRCCTPHRTTTLPVEFKCPDGEVMKKNMMF I KTCACHYNCPGDND I 

i ■ i i t t iti tit t i t i t 

i till i lit lit i till 

NVENVIGTSMKDVLIGDAOANTLMGBGGDDT-VRGGDGDDLLFGGDGNDMLYGDAGNDT LYGGLGD— DT 

1000 1010 1020 1030 1040 1050 1060 

340 X 
FESLYYRKMYGDMA 

i i 
i i 

LEGGAGNDWFGQTQAREHDVLRGG 
1 070 X 1 080 



9- US-07-752-427-2 ( 1-349) 

R08267 Platelet derived growth factor (PDGF) receptor pro 

ID R08267 standard; protein; 1089 AA. 

AC ROS267 ; 

DT 07-MAR-1991 (first entry) 

DE Platelet derived growth factor (PDGF) receptor protein. 
KW Atherosclerosis; fibrotic diseases- 
OS Homo sapiens. 
PN W090 1 4425-A. 
PD 29-N0V- 1 990. 
PF 21 -MAY- 1990; U02849. 
PR 22-MAY- 1 989 ? US-3550 1 8. 
PA (ZYMO-) ZYMOGENETICS INC. 

PI Kelly JDf Murray MJ; 

DR WPi; 90-375992/50. 

DR N-FSDB ; Q06869. 

PT DNA encoding platelet-derived growth factor - used to trans form 

PT cells for culturing to detect PDG agonists and antagonists 

PS Claim l; Fig 1 ; 30pp? English. 

CC Gene product may be expressed from a transformed cell. It has 

CC utility in dection of PDGF agonist and antagonist analogues, binding 

CC AAf AB and BB isoforms. PDGF agonists may be used to enhance wound 

CC healing, and antagonists may be used to block the effects of PDGF 

CC eg. in treatment of ^^srosc 1 eros i s or fibrotic ^keases. 

SO Sequence 1089 AA; 

so 54 a; 44 r; 44 n; 66 d; o b; 17 c; 25 o; 94 e; o z; 58 g; 25 h; 



Initial Score = 6 Optimized Score = 56 ^^nificance = 3.57 

Residue Identity = 19% Matches = 72 Wsmatches = 251 

Gaps = 44 Conservative Substitutions = O 



X 10 20 30 40 50 60 

MTAASMGPVRVAFVVL-LALCSRPAVGQNCSGPCRCPDEPAPRCPAGVSLVLDGCGCCRV CAKQLGEL 

ii i i ■ i i ii ii •• * * 

ll till l fill lilt 

MGTSHP AFLVLGCLLTGLSL I LCQLSLPS I LPNENEK WGLNSSFSLRCFGESEVSWQ YPMSEEESS 
X 10 20 30 40 50 60 

70 80 90 lOO llO 120 130 

CTERDPCDPHKGLFCDFGSPANRK I GVCT AKDGAPC I FGGTVY RSGESFGSSCKYGCTCLDGAVGCMP 

i ill ttt ii i tti 

i iti tit it i lit 

DVE I RNEENNSGLF VTLLE VSS A — SA AHTGL YTC Y YNHTOTEENELEGRH I Y I Y VPDPD VAF VP 

70 80 SO lOO HO 120 130 



140 ISO 160 170 180 ISO 

LCSMDVRLPSPD CPFPRRVKLPGKCCEEWVCDEFKDGTVVGPAL.AAYR — LEDTFGFDPTM I RA NC 

ti * i*i ii iti i iii * 

t i i iii it tti t iii i 

LGMTDYLV I VEDDDSA 1 1 PCRTTDP- ETPV — TLHNSEG VVP AS YDSROGFNGTFT VGP Y I CEATVKGK 

140 150 160 170 ISO 190 



200 210 220 230 240 250 2GO 

LVOT — TEWSACSKTCGMG I STRVTNDNASCRLEKQSRLCMVRPCEADLEENI KKGKKC I RTPK I SKP I KFE 

it it i t iiit t i t 

it It i t iiit i i i 

KFQT I PFNVY ALK AT SELDLEMEALKTVYK-SGET I V VTC AVFNNE WDLQWTYPGE VKGKG I T I LE 

200 210 220 230 240 2SO 260 

270 2SO 2SO 300 31 0 320 330 

LSGCTSMK TYRAKFCGVCTDGRCCTPHRTTTLPV— EFKCPDGEVMKKNMMF I KTCACHYNCPGDND I FE 

iii ii t t iii it it 

tti t i i i iii it it 

E I K VPS I KLV YTLT VPEAT VKDSGDYECAARBATREVKEMKK VT I SVHEKGF I E I KPTFSQLEAVNLHEVKH 

270 280 290 300 310 320 330 



340 X 
SLYYRKMYGDMA 

F V VE VRAYPPPR I SWLKNNLTL 
340 X 350 



10. US-07-752-427-2 (1-349) 

R06910 Alpha type PDGF receptor deduced from TR4 cDNA clo 

ID R06910 standard; protein; 1089 AA. 

AC R069 1 O ; 

DT 16-JAN-1991 (first entry) 

DE Alpha type PDGF receptor deduced from TR4 cDNA clone. 

KW Platelet derived growth factor; Til. 

□S Homo sapiens. 

FH Key Locat i on/Qua 1 i f i ers 

FT Domain 1. . 23 

FT /label=signal peptide 

FT Domain 24. . 524 

FT /label^l igand binding domain 

FT Domain 525. . 548 

FT /label -transmembrane region 

FT Domain 549. . 599 

FT / 1 abe 1 = j ux tamembrane doma i n 

FT Binding-site 600. . 627 

FT /label=ATP binding site 

FT Mod i f i ed-s i te 849 

FT /label=tyrosine autophosphory 1 at 1 on site 

FT Mod i f i ed-s i te 42. . 44 



_s i xe 




76. . 78 


_site 




103. . 


105 


_site 




179. . 


181 


_site 




353. . 


355 


_site 




359. . 


361 


_si te 




458. . 


460 


_si te 




468. . 


470 


site 





r I / i aoe i =N-g i ycos 

FT Mod i -f i ed-s i te 

FT /l abe 1 =N-g 1 ycos 

FT Mod if ied-site 

FT /I abe 1 =N-g 1 ycos 

FT Mod if ied-site 

FT /I abe 1 -N-g 1 ycos 

FT Mod if ied-site 

FT /I abe 1 =N-g 1 ycos 

FT Mod i f 1 ed-s i te 

FT / 1 abe 1 =N-g 1 ycos. 

FT Mod i f i ed-s i te 

FT / 1 abe 1 =N-g 1 ycos, 

FT Mod i f i ed-s i te 

FT / 1 abe 1 =N-g 1 ycos 

PN W090 1 00 1 3-A. 

PD 07-SEP- 1 990. 

PF 08-FEB-1990; U00617. 

PR 09-FEB-1383; US-308282. 

PA (USDC ) US SEC OF COMMERCE. 

PI Matsui Tf Aaronson SAj Pierce JH; 

DR WPi; 90-230306/38. 

DR N-PSDB; G05989. 

PT Type alpha platelet-derived growth factor receptor gene - useful 

PT for^ transforming cells to express novel protein receptor and also 

PT susceptible to genetic engineering. 

PS Claim 7; Fig 3; 64pp; English. 

CC The TR4 clone is the largest cDNA clone related to the Til genomic 

CC clone* isolated from a library prepd. from human thymus DNA. The 

CC T4 cDNA clone was isolated from a M426 human embryo fibroblast 

CC cDNA library. The coding region can be introduced into the pSV2 

CC gpt vector with a simian sarcoma virus LTR as a promoter and 

CC expressed in a host. The resulting protein is a novel PDGF 

CC receptor designated type alpha (the Known receptor is designated 

CC type beta). The polypeptide has a calculated molecular mass of 120 

CC kD and has all the characteristics of a membrane spanning tyrosine 

CC Kinase receptor. The extracellular region comprises a hydrophobic 

CC signal peptide and a ligand binding domain which has structural 

CC homology with the PDGF— R/CSF 1 — R subfamily. Ten Cys residues are 

CC spaced at the same positions as in other receptors of the sub— 

CC family and eight potential N-linKed glycosy lat ion sites are also 

CC present. A hydrophobic segment spans the membrane and the cyto- 

CC plasmic region comprises a j uxtamembrane region* a tyrosine Kinase 

CC region split into TK1 and TK2 by a hydrophilic interkinase region 

CC and a hydrophilic C-terminal tail. The TK region includes the 

CC consensus ATP binding sequence ( G-X-G-X-X-G. . . K ) and a tyrosine 

CC autophosphorylat ion site homologous to that of pp60(v-src). 

S9 Sequence 1089 AA; 

sq 54 a; 44 r; 44 n; 66 d; o b; 17 c; 26 o; 93 e; o z; 58 g; 25 h; 

so 70 i; 104L; 69 k; 26 m; 32 f; 51 p; S3 s; 68 t; 13 w; 47 y; 89 v; 

Initial Score * 6 Optimized Score = 56 Significance = 3.57 

Residue Identity = 19% Matches = 72 Mismatches = 251 

Gaps = 44 Conservative Substitutions « O 

X lO 20 30 40 SO 60 

MTAASMGPVRVAFVVL— LALCSRPAVGQNCSGPCRCPDEPAPRCPAGVSLVLDGCGCCRV CAKQLGEL 

ti iiiii iiit ti» i 

it iiiii tiii ii i i 

MGTSHPAFLVLGCLLTGLSL I LCQLSLPS I LPNENEK VVQLNSSFSLRCFGESEVSWGYPMSEEESS 

X 10 20 30 40 50 60 

70 SO 30 1 OO 1 1 O 1 20 1 30 

CTERDPCDPHK GLFCDFGSP ANRK I GVCTAKDGAPC I FGGTVY RSGESFOSSCK YQCTCLDG A VGCMP 



't^evssa — saahtglytcyynhtoteWei 



DVE I RNEENNSGLF VT^CEVSS A — S A AHTGLYTCY YNHTQTEBWELEGRH I Y I Y VPDPDV AFVP 

70 80 SO lOO HO 120 130 



